Fast Counting in Machine Learning Applications

04/12/2018
by   Subhadeep Karan, et al.
0

We propose scalable methods to execute counting queries in machine learning applications. To achieve memory and computational efficiency, we abstract counting queries and their context such that the counts can be aggregated as a stream. We demonstrate performance and scalability of the resulting approach on random queries, and through extensive experimentation using Bayesian networks learning and association rule mining. Our methods significantly outperform commonly used ADtrees and hash tables, and are practical alternatives for processing large-scale data.

READ FULL TEXT
research
03/01/1998

Cached Sufficient Statistics for Efficient Machine Learning with Large Datasets

This paper introduces new algorithms and data structures for quick count...
research
12/02/2021

A short note on the counting complexity of conjunctive queries

This note closes a minor gap in the literature on the counting complexit...
research
06/01/2020

F-IVM: Learning over Fast-Evolving Relational Data

F-IVM is a system for real-time analytics such as machine learning appli...
research
11/20/2012

A Traveling Salesman Learns Bayesian Networks

Structure learning of Bayesian networks is an important problem that ari...
research
06/13/2015

Graphlet Decomposition: Framework, Algorithms, and Applications

From social science to biology, numerous applications often rely on grap...
research
06/21/2019

Learning to Sample: Counting with Complex Queries

In this paper we present a suite of methods to efficiently estimate coun...
research
10/13/2020

Data Engineering for HPC with Python

Data engineering is becoming an increasingly important part of scientifi...

Please sign up or login with your details

Forgot password? Click here to reset