Multi-Mention Learning for Reading Comprehension with Neural Cascades

11/02/2017
by   Swabha Swayamdipta, et al.
0

Reading comprehension is a challenging task, especially when executed across longer or across multiple evidence documents, where the answer is likely to reoccur. Existing neural architectures typically do not scale to the entire evidence, and hence, resort to selecting a single passage in the document (either via truncation or other means), and carefully searching for the answer within that passage. However, in some cases, this strategy can be suboptimal, since by focusing on a specific passage, it becomes difficult to leverage multiple mentions of the same answer throughout the document. In this work, we take a different approach by constructing lightweight models that are combined in a cascade to find the answer. Each submodel consists only of feed-forward networks equipped with an attention mechanism, making it trivially parallelizable. We show that our approach can scale to approximately an order of magnitude larger evidence documents and can aggregate information from multiple mentions of each answer candidate across the document. Empirically, our approach achieves state-of-the-art performance on both the Wikipedia and web domains of the TriviaQA dataset, outperforming more complex, recurrent architectures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2018

A Co-Matching Model for Multi-choice Reading Comprehension

Multi-choice reading comprehension is a challenging task, which involves...
research
05/06/2018

Multi-Passage Machine Reading Comprehension with Cross-Passage Answer Verification

Machine reading comprehension (MRC) on real web data usually requires th...
research
10/31/2016

End-to-End Answer Chunk Extraction and Ranking for Reading Comprehension

This paper proposes dynamic chunk reader (DCR), an end-to-end neural rea...
research
05/16/2018

Joint Training of Candidate Extraction and Answer Selection for Reading Comprehension

While sophisticated neural-based techniques have been developed in readi...
research
12/05/2018

Weighted Global Normalization for Multiple Choice ReadingComprehension over Long Documents

Motivated by recent evidence pointing out the fragility of high-performi...
research
11/28/2018

A Deep Cascade Model for Multi-Document Reading Comprehension

A fundamental trade-off between effectiveness and efficiency needs to be...
research
10/17/2017

Constructing Datasets for Multi-hop Reading Comprehension Across Documents

Most Reading Comprehension methods limit themselves to queries which can...

Please sign up or login with your details

Forgot password? Click here to reset