Differentially Private Bootstrap: New Privacy Analysis and Inference Strategies

10/12/2022
by   Zhanyu Wang, et al.
0

Differential private (DP) mechanisms protect individual-level information by introducing randomness into the statistical analysis procedure. While there are now many DP tools for various statistical problems, there is still a lack of general techniques to understand the sampling distribution of a DP estimator, which is crucial for uncertainty quantification in statistical inference. We analyze a DP bootstrap procedure that releases multiple private bootstrap estimates to infer the sampling distribution and construct confidence intervals. Our privacy analysis includes new results on the privacy cost of a single DP bootstrap estimate applicable to incorporate arbitrary DP mechanisms and identifies some misuses of the bootstrap in the existing literature. We show that the release of B DP bootstrap estimates from mechanisms satisfying (μ/√((2-2/e)B))-Gaussian DP asymptotically satisfies μ-Gaussian DP as B goes to infinity. We also develop a statistical procedure based on the DP bootstrap estimates to correctly infer the sampling distribution using techniques related to the deconvolution of probability measures, an approach which is novel in analyzing DP procedures. From our density estimate, we construct confidence intervals and compare them to existing methods through simulations and real-world experiments using the 2016 Canada Census Public Use Microdata. The coverage of our private confidence intervals achieves the nominal confidence level, while other methods fail to meet this guarantee.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2022

Noise-Aware Statistical Inference with Differentially Private Synthetic Data

While generation of synthetic data under differential privacy (DP) has r...
research
10/28/2021

Privacy Preserving Inference on the Ratio of Two Gaussians Using (Weighted) Sums

The ratio of two Gaussians is useful in many contexts of statistical inf...
research
06/11/2023

Fast, Distribution-free Predictive Inference for Neural Networks with Coverage Guarantees

This paper introduces a novel, computationally-efficient algorithm for p...
research
10/22/2021

A Feasibility Study of Differentially Private Summary Statistics and Regression Analyses for Administrative Tax Data

Federal administrative tax data are invaluable for research, but because...
research
05/18/2018

Method G: Uncertainty Quantification for Distributed Data Problems using Generalized Fiducial Inference

It is not unusual for a data analyst to encounter data sets distributed ...
research
03/09/2023

Simulation-based, Finite-sample Inference for Privatized Data

Privacy protection methods, such as differentially private mechanisms, i...
research
06/27/2022

Network resampling for estimating uncertainty

With network data becoming ubiquitous in many applications, many models ...

Please sign up or login with your details

Forgot password? Click here to reset