Right Answer for the Wrong Reason: Discovery and Mitigation

04/20/2018
by   Shi Feng, et al.
0

Exposing the weaknesses of neural models is crucial for improving their performance and robustness in real-world applications. One common approach is to examine how input perturbations affect the output. Our analysis takes this to an extreme on natural language processing tasks by removing as many words as possible from the input without changing the model prediction. For question answering and natural language inference, this of- ten reduces the inputs to just one or two words, while model confidence remains largely unchanged. This is an undesireable behavior: the model gets the Right Answer for the Wrong Reason (RAWR). We introduce a simple training technique that mitigates this problem while maintaining performance on regular examples.

READ FULL TEXT
research
12/10/2021

Improving the Question Answering Quality using Answer Candidate Filtering based on Natural-Language Features

Software with natural-language user interfaces has an ever-increasing im...
research
04/18/2021

Can NLI Models Verify QA Systems' Predictions?

To build robust question answering systems, we need the ability to verif...
research
09/01/2019

Incidental Supervision from Question-Answering Signals

Human annotations are costly for many natural language processing (NLP) ...
research
11/10/2022

Understanding Text Classification Data and Models Using Aggregated Input Salience

Realizing when a model is right for a wrong reason is not trivial and re...
research
05/05/2015

Ask Your Neurons: A Neural-based Approach to Answering Questions about Images

We address a question answering task on real-world images that is set up...
research
10/01/2014

A Multi-World Approach to Question Answering about Real-World Scenes based on Uncertain Input

We propose a method for automatically answering questions about images b...
research
09/17/2020

On the Transferability of Minimal Prediction Preserving Inputs in Question Answering

Recent work (Feng et al., 2018) establishes the presence of short, unint...

Please sign up or login with your details

Forgot password? Click here to reset