Binary Fuse Filters: Fast and Smaller Than Xor Filters

01/04/2022
by   Thomas Mueller Graf, et al.
0

Bloom and cuckoo filters provide fast approximate set membership while using little memory. Engineers use them to avoid expensive disk and network accesses. The recently introduced xor filters can be faster and smaller than Bloom and cuckoo filters. The xor filters are within 23 in storage as opposed to 44 Walzer, we build probabilistic filters – called binary fuse filters – that are within 13 As an additional benefit, the construction of the new binary fuse filters can be more than twice as fast as the construction of xor filters. By slightly sacrificing query speed, we further reduce storage to within 8 bound. We compare the performance against a wide range of competitive alternatives such as Bloom filters, blocked Bloom filters, vector quotient filters, cuckoo filters, and the recent ribbon filters. Our experiments suggest that binary fuse filters are superior to xor filters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/17/2019

Xor Filters: Faster and Smaller Than Bloom and Cuckoo Filters

The Bloom filter provides fast approximate set membership while using li...
research
05/05/2020

Conditional Cuckoo Filters

Bloom filters, cuckoo filters, and other approximate set membership sket...
research
11/12/2021

Approximate Membership Query Filters with a False Positive Free Set

In the last decade, significant efforts have been made to reduce the fal...
research
01/06/2020

Applying Information Theory to Design Optimal Filters for Photometric Redshifts

In this paper we apply ideas from information theory to create a method ...
research
05/13/2022

Inversion of finite filters

We present a decomposition of finitely supported filters ( aka instrumen...
research
08/05/2022

Compressing (Multidimensional) Learned Bloom Filters

Bloom filters are widely used data structures that compactly represent s...
research
08/06/2019

Full-Stack Filters to Build Minimum Viable CNNs

Deep convolutional neural networks (CNNs) are usually over-parameterized...

Please sign up or login with your details

Forgot password? Click here to reset