Model Explanations with Differential Privacy

06/16/2020
by   Neel Patel, et al.
0

Black-box machine learning models are used in critical decision-making domains, giving rise to several calls for more algorithmic transparency. The drawback is that model explanations can leak information about the training data and the explanation data used to generate them, thus undermining data privacy. To address this issue, we propose differentially private algorithms to construct feature-based model explanations. We design an adaptive differentially private gradient descent algorithm, that finds the minimal privacy budget required to produce accurate explanations. It reduces the overall privacy loss on explanation data, by adaptively reusing past differentially private explanations. It also amplifies the privacy guarantees with respect to the training data. We evaluate the implications of differentially private models and our privacy mechanisms on the quality of model explanations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2018

Three Tools for Practical Differential Privacy

Differentially private learning on real-world data poses challenges for ...
research
12/08/2022

XRand: Differentially Private Defense against Explanation-Guided Attacks

Recent development in the field of explainable artificial intelligence (...
research
12/09/2021

Differentially Private Ensemble Classifiers for Data Streams

Learning from continuous data streams via classification/regression is p...
research
08/04/2022

Differentially Private Counterfactuals via Functional Mechanism

Counterfactual, serving as one emerging type of model explanation, has a...
research
05/15/2023

Privacy Auditing with One (1) Training Run

We propose a scheme for auditing differentially private machine learning...
research
06/05/2019

Interpretable and Differentially Private Predictions

Interpretable predictions, where it is clear why a machine learning mode...
research
08/08/2023

Accurate, Explainable, and Private Models: Providing Recourse While Minimizing Training Data Leakage

Machine learning models are increasingly utilized across impactful domai...

Please sign up or login with your details

Forgot password? Click here to reset