Locally Differentially Private Data Collection and Analysis

by   Teng Wang, et al.

Local differential privacy (LDP) can provide each user with strong privacy guarantees under untrusted data curators while ensuring accurate statistics derived from privatized data. Due to its powerfulness, LDP has been widely adopted to protect privacy in various tasks (e.g., heavy hitters discovery, probability estimation) and systems (e.g., Google Chrome, Apple iOS). Although ϵ-LDP has been proposed for many years, the more general notion of (ϵ, δ)-LDP has only been studied in very few papers, which mainly consider mean estimation for numeric data. Besides, prior solutions achieve (ϵ, δ)-LDP by leveraging Gaussian mechanism, which leads to low accuracy of the aggregated results. In this paper, we propose novel mechanisms that achieve (ϵ, δ)-LDP with high utility in data analytics and machine learning. Specifically, we first design (ϵ, δ)-LDP algorithms for collecting multi-dimensional numeric data, which can ensure higher accuracy than the optimal Gaussian mechanism while guaranteeing strong privacy for each user. Then, we investigate different local protocols for categorical attributes under (ϵ, δ)-LDP. Furthermore, we conduct theoretical analysis on the error bound and variance of the proposed algorithms. Experimental results on real and synthetic datasets demonstrate the high data utility of our proposed algorithms on both simple data statistics and complex machine learning models.


PCKV: Locally Differentially Private Correlated Key-Value Data Collection with Optimized Utility

Data collection under local differential privacy (LDP) has been mostly s...

Collecting Telemetry Data Privately

The collection and analysis of telemetry data from users' devices is rou...

Subset Privacy: Draw from an Obfuscated Urn

With the rapidly increasing ability to collect and analyze personal data...

A Comprehensive Survey on Local Differential Privacy Toward Data Statistics and Analysis in Crowdsensing

Collecting and analyzing massive data generated from smart devices have ...

Utility Analysis and Enhancement of LDP Mechanisms in High-Dimensional Space

Local differential privacy (LDP), which perturbs the data of each user l...

Calibrate: Frequency Estimation and Heavy Hitter Identification with Local Differential Privacy via Incorporating Prior Knowledge

Estimating frequencies of certain items among a population is a basic st...

Task-aware Privacy Preservation for Multi-dimensional Data

Local differential privacy (LDP), a state-of-the-art technique for priva...

Please sign up or login with your details

Forgot password? Click here to reset