Explaining Black Boxes on Sequential Data using Weighted Automata

10/12/2018
by   Stéphane Ayache, et al.
0

Understanding how a learned black box works is of crucial interest for the future of Machine Learning. In this paper, we pioneer the question of the global interpretability of learned black box models that assign numerical values to symbolic sequential data. To tackle that task, we propose a spectral algorithm for the extraction of weighted automata (WA) from such black boxes. This algorithm does not require the access to a dataset or to the inner representation of the black box: the inferred model can be obtained solely by querying the black box, feeding it with inputs and analyzing its outputs. Experiments using Recurrent Neural Networks (RNN) trained on a wide collection of 48 synthetic datasets and 2 real datasets show that the obtained approximation is of great quality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2020

Distillation of Weighted Automata from Recurrent Neural Networks using a Spectral Approach

This paper is an attempt to bridge the gap between deep learning and gra...
research
06/05/2021

Extracting Weighted Automata for Approximate Minimization in Language Modelling

In this paper we study the approximate minimization problem for language...
research
07/27/2018

Interpreting RNN behaviour via excitable network attractors

Machine learning has become a basic tool in scientific research and for ...
research
02/09/2023

Symbolic Metamodels for Interpreting Black-boxes Using Primitive Functions

One approach for interpreting black-box machine learning models is to fi...
research
08/04/2020

Making Sense of CNNs: Interpreting Deep Representations Their Invariances with INNs

To tackle increasingly complex tasks, it has become an essential ability...
research
03/27/2023

Bisimilar States in Uncertain Structures

We provide a categorical notion called uncertain bisimilarity, which all...
research
09/10/2019

Numerical integration of functions of a rapidly rotating phase

We present an algorithm for the efficient numerical evaluation of integr...

Please sign up or login with your details

Forgot password? Click here to reset