Interventional Probing in High Dimensions: An NLI Case Study

04/20/2023
by   Julia Rozanova, et al.
2

Probing strategies have been shown to detect the presence of various linguistic features in large language models; in particular, semantic features intermediate to the "natural logic" fragment of the Natural Language Inference task (NLI). In the case of natural logic, the relation between the intermediate features and the entailment label is explicitly known: as such, this provides a ripe setting for interventional studies on the NLI models' representations, allowing for stronger causal conjectures and a deeper critical analysis of interventional probing methods. In this work, we carry out new and existing representation-level interventions to investigate the effect of these semantic features on NLI classification: we perform amnesic probing (which removes features as directed by learned linear probes) and introduce the mnestic probing variation (which forgets all dimensions except the probe-selected ones). Furthermore, we delve into the limitations of these methods and outline some pitfalls have been obscuring the effectivity of interventional probing studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2023

Estimating the Causal Effects of Natural Logic Features in Neural NLI Models

Rigorous evaluation of the causal effects of semantic features on langua...
research
12/15/2021

Decomposing Natural Logic Inferences in Neural NLI

In the interest of interpreting neural NLI models and their reasoning st...
research
03/01/2023

Competence-Based Analysis of Language Models

Despite the recent success of large pretrained language models (LMs) on ...
research
07/06/2023

NatLogAttack: A Framework for Attacking Natural Language Inference Models with Natural Logic

Reasoning has been a central topic in artificial intelligence from the b...
research
02/20/2018

On the scaling of polynomial features for representation matching

In many neural models, new features as polynomial functions of existing ...
research
04/12/2021

Does My Representation Capture X? Probe-Ably

Probing (or diagnostic classification) has become a popular strategy for...
research
08/28/2018

Semantic Matching Against a Corpus: New Applications and Methods

We consider the case of a domain expert who wishes to explore the extent...

Please sign up or login with your details

Forgot password? Click here to reset