Every major technical invention resurfaces the dual-use dilemma – the ne...
Previous works show that deep NLP models are not always conceptually sou...
This paper studies faithful explanations for Graph Neural Networks (GNNs...
Counterfactual examples are one of the most commonly-cited methods for
e...
Recent work on explaining Deep Neural Networks (DNNs) focuses on attribu...
While "attention is all you need" may be proving true, we do not yet kno...
While Deep Neural Networks (DNNs) are becoming the state-of-the-art for ...
With the growing use of AI in highly consequential domains, the
quantifi...
Feature attributions are a popular tool for explaining the behavior of D...
LSTM-based recurrent neural networks are the state-of-the-art for many
n...
Attribution methods that explains the behaviour of machine learning mode...
We study the phenomenon of bias amplification in classifiers, wherein a
...
A machine learning model may exhibit discrimination when used to make
de...
Privacy and nondiscrimination are related but different. We make this
ob...
We examine whether neural natural language processing (NLP) systems refl...
Causal influence measures for machine learnt classifiers shed light on t...
We study the problem of explaining a rich class of behavioral properties...
Many machine learning systems utilize latent factors as internal
represe...
In this report, we applied integrated gradients to explaining a neural
n...