Trick Me If You Can: Adversarial Writing of Trivia Challenge Questions

09/07/2018
by   Eric Wallace, et al.
0

Modern natural language processing systems have been touted as approaching human performance. However, existing datasets are imperfect tests. Examples are written with humans in mind, not computers, and often do not properly expose model limitations. We address this by developing a new process for crowdsourced annotation, adversarial writing, where humans interact with trained models and try to break them. Applying this annotation process to Trivia question answering yields a challenge set, which despite being easy for human players to answer, systematically stumps automated question answering systems. Diagnosing model errors on the evaluation data provides actionable insights to explore in developing more robust and generalizable question answering systems.

READ FULL TEXT

page 2

page 6

research
05/08/2022

Chart Question Answering: State of the Art and Future Directions

Information visualizations such as bar charts and line charts are very c...
research
05/19/2022

Modeling Exemplification in Long-form Question Answering via Retrieval

Exemplification is a process by which writers explain or clarify a conce...
research
12/13/2022

Despite "super-human" performance, current LLMs are unsuited for decisions about ethics and safety

Large language models (LLMs) have exploded in popularity in the past few...
research
04/09/2020

Natural Perturbation for Robust Question Answering

While recent models have achieved human-level scores on many NLP dataset...
research
04/18/2021

Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation

Despite the availability of very large datasets and pretrained models, s...
research
02/01/2021

Can Small and Synthetic Benchmarks Drive Modeling Innovation? A Retrospective Study of Question Answering Modeling Approaches

Datasets are not only resources for training accurate, deployable system...

Please sign up or login with your details

Forgot password? Click here to reset