Probabilistic Dataset Reconstruction from Interpretable Models

08/29/2023
by   Julien Ferry, et al.
0

Interpretability is often pointed out as a key requirement for trustworthy machine learning. However, learning and releasing models that are inherently interpretable leaks information regarding the underlying training data. As such disclosure may directly conflict with privacy, a precise quantification of the privacy impact of such breach is a fundamental problem. For instance, previous work have shown that the structure of a decision tree can be leveraged to build a probabilistic reconstruction of its training dataset, with the uncertainty of the reconstruction being a relevant metric for the information leak. In this paper, we propose of a novel framework generalizing these probabilistic reconstructions in the sense that it can handle other forms of interpretable models and more generic types of knowledge. In addition, we demonstrate that under realistic assumptions regarding the interpretable models' structure, the uncertainty of the reconstruction can be computed efficiently. Finally, we illustrate the applicability of our approach on both decision trees and rule lists, by comparing the theoretical information leak associated to either exact or heuristic learning algorithms. Our results suggest that optimal interpretable models are often more compact and leak less information regarding their training data than greedily-built ones, for a given accuracy level.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/01/2019

Interpretable multiclass classification by MDL-based rule lists

Interpretable classifiers have recently witnessed an increase in attenti...
research
02/27/2016

Scalable Bayesian Rule Lists

We present an algorithm for building probabilistic rule lists that is tw...
research
06/10/2022

Explaining Neural Networks without Access to Training Data

We consider generating explanations for neural networks in cases where t...
research
01/30/2023

Optimal Decision Tree Policies for Markov Decision Processes

Interpretability of reinforcement learning policies is essential for man...
research
06/17/2019

Learning Interpretable Models Using an Oracle

As Machine Learning (ML) becomes pervasive in various real world systems...
research
04/16/2023

Assisting clinical practice with fuzzy probabilistic decision trees

The need for fully human-understandable models is increasingly being rec...

Please sign up or login with your details

Forgot password? Click here to reset