Answering Multi-Dimensional Range Queries under Local Differential Privacy

by   Jianyu Yang, et al.

In this paper, we tackle the problem of answering multi-dimensional range queries under local differential privacy. There are three key technical challenges: capturing the correlations among attributes, avoiding the curse of dimensionality, and dealing with the large domains of attributes. None of the existing approaches satisfactorily deals with all three challenges. Overcoming these three challenges, we first propose an approach called Two-Dimensional Grids (TDG). Its main idea is to carefully use binning to partition the two-dimensional (2-D) domains of all attribute pairs into 2-D grids that can answer all 2-D range queries and then estimate the answer of a higher dimensional range query from the answers of the associated 2-D range queries. However, in order to reduce errors due to noises, coarse granularities are needed for each attribute in 2-D grids, losing fine-grained distribution information for individual attributes. To correct this deficiency, we further propose Hybrid-Dimensional Grids (HDG), which also introduces 1-D grids to capture finer-grained information on distribution of each individual attribute and combines information from 1-D and 2-D grids to answer range queries. To make HDG consistently effective, we provide a guideline for properly choosing granularities of grids based on an analysis of how different sources of errors are impacted by these choices. Extensive experiments conducted on real and synthetic datasets show that HDG can give a significant improvement over the existing approaches.


page 15

page 22


Answering Summation Queries for Numerical Attributes under Differential Privacy

In this work we explore the problem of answering a set of sum queries un...

Cortex: Harnessing Correlations to Boost Query Performance

Databases employ indexes to filter out irrelevant records, which reduces...

Castell: Scalable Joint Probability Estimation of Multi-dimensional Data Randomized with Local Differential Privacy

Performing randomized response (RR) over multi-dimensional data is subje...

Multi-Dimensional Randomized Response

In our data world, a host of not necessarily trusted controllers gather ...

A Differentially Private Algorithm for Range Queries on Trajectories

We propose a novel algorithm to ensure ϵ-differential privacy for answer...

Making the Most of Parallel Composition in Differential Privacy

We show that the `optimal' use of the parallel composition theorem corre...

Z-ordered Range Refinement for Multi-dimensional Range Queries

The z-order curve is a space-filling curve and is now attracting the int...

Please sign up or login with your details

Forgot password? Click here to reset