Approximate Query Processing over Static Sets and Sliding Windows

09/14/2018
by   Ran Ben Basat, et al.
0

Indexing of static and dynamic sets is fundamental to a large set of applications such as information retrieval and caching. Denoting the characteristic vector of the set by B, we consider the problem of encoding sets and multisets to support approximate versions of the operations rank(i) (i.e., computing sum_j <= iB[j]) and select(i) (i.e., finding minp | rank(p) >= i) queries. We study multiple types of approximations (allowing an error in the query or the result) and present lower bounds and succinct data structures for several variants of the problem. We also extend our model to sliding windows, in which we process a stream of elements and compute suffix sums. This is a generalization of the window summation problem that allows the user to specify the window size at query time. Here, we provide an algorithm that supports updates and queries in constant time while requiring just (1+o(1)) factor more space than the fixed-window summation algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/23/2018

Sliding Suffix Tree

We consider a sliding window over a stream of characters from some finit...
research
08/27/2020

Cost-based Query Rewriting Techniques for Optimizing Aggregates Over Correlated Windows

Window aggregates are ubiquitous in stream processing. In Azure Stream A...
research
06/11/2023

Time-limited Bloom Filter

A Bloom Filter is a probabilistic data structure designed to check, rapi...
research
01/09/2020

Age-Partitioned Bloom Filters

Bloom filters (BF) are widely used for approximate membership queries ov...
research
03/08/2021

Sliding Window Persistence of Quasiperiodic Functions

A function is called quasiperiodic if its fundamental frequencies are li...
research
12/02/2020

Fast Automatic Feature Selection for Multi-Period Sliding Window Aggregate in Time Series

As one of the most well-known artificial feature sampler, the sliding wi...
research
04/28/2018

Heavy Hitters over Interval Queries

Heavy hitters and frequency measurements are fundamental in many network...

Please sign up or login with your details

Forgot password? Click here to reset