To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering

12/20/2022
by   Dheeru Dua, et al.
0

Recent advances in open-domain question answering (ODQA) have demonstrated impressive accuracy on standard Wikipedia style benchmarks. However, it is less clear how robust these models are and how well they perform when applied to real-world applications in drastically different domains. While there has been some work investigating how well ODQA models perform when tested for out-of-domain (OOD) generalization, these studies have been conducted only under conservative shifts in data distribution and typically focus on a single component (ie. retrieval) rather than an end-to-end system. In response, we propose a more realistic and challenging domain shift evaluation setting and, through extensive experiments, study end-to-end model performance. We find that not only do models fail to generalize, but high retrieval scores often still yield poor answer prediction accuracy. We then categorize different types of shifts and propose techniques that, when presented with a new dataset, predict if intervention methods are likely to be successful. Finally, using insights from this analysis, we propose and evaluate several intervention methods which improve end-to-end answer F1 score by up to 24 points.

READ FULL TEXT

page 7

page 8

research
02/09/2023

Robust Question Answering against Distribution Shifts with Test-Time Adaptation: An Empirical Study

A deployed question answering (QA) model can easily fail when the test d...
research
02/05/2019

End-to-End Open-Domain Question Answering with BERTserini

We demonstrate an end-to-end question answering system that integrates B...
research
04/15/2022

Improving Passage Retrieval with Zero-Shot Question Generation

We propose a simple and effective re-ranking method for improving passag...
research
01/06/2018

Analysis of Wikipedia-based Corpora for Question Answering

This paper gives comprehensive analyses of corpora based on Wikipedia fo...
research
01/02/2021

End-to-End Training of Neural Retrievers for Open-Domain Question Answering

Recent work on training neural retrievers for open-domain question answe...
research
03/29/2021

Domain-robust VQA with diverse datasets and methods but no target labels

The observation that computer vision methods overfit to dataset specific...
research
05/02/2019

Conditioning LSTM Decoder and Bi-directional Attention Based Question Answering System

Applying neural-networks on Question Answering has gained increasing pop...

Please sign up or login with your details

Forgot password? Click here to reset