Simple and Effective Semi-Supervised Question Answering

04/02/2018
by   Bhuwan Dhingra, et al.
0

Recent success of deep learning models for the task of extractive Question Answering (QA) is hinged on the availability of large annotated corpora. However, large domain specific annotated corpora are limited and expensive to construct. In this work, we envision a system where the end user specifies a set of base documents and only a few labelled examples. Our system exploits the document structure to create cloze-style questions from these base documents; pre-trains a powerful neural network on the cloze style questions; and further fine-tunes the model on the labeled examples. We evaluate our proposed system across three diverse datasets from different domains, and find it to be highly effective with very little labeled data. We attain more than 50 SQuAD and TriviaQA with less than a thousand labelled examples. We are also releasing a set of 3.2M cloze-style questions for practitioners to use while building QA systems.

READ FULL TEXT
research
11/30/2022

A Pipeline for Generating, Annotating and Employing Synthetic Data for Real World Question Answering

Question Answering (QA) is a growing area of research, often used to fac...
research
11/06/2019

Towards Domain Adaptation from Limited Data for Question Answering Using Deep Neural Networks

This paper explores domain adaptation for enabling question answering (Q...
research
02/06/2020

Generating Scientific Question Answering Corpora from Q A forums

Question Answering (QA) is a natural language processing task that aims ...
research
05/21/2018

Efficient and Robust Question Answering from Minimal Context over Documents

Neural models for question answering (QA) over documents have achieved s...
research
05/06/2020

Harvesting and Refining Question-Answer Pairs for Unsupervised QA

Question Answering (QA) has shown great success thanks to the availabili...
research
03/06/2020

Practical Annotation Strategies for Question Answering Datasets

Annotating datasets for question answering (QA) tasks is very costly, as...
research
11/27/2022

Improving Low-Resource Question Answering using Active Learning in Multiple Stages

Neural approaches have become very popular in the domain of Question Ans...

Please sign up or login with your details

Forgot password? Click here to reset