NorQuAD: Norwegian Question Answering Dataset

05/03/2023
by   Sardana Ivanova, et al.
0

In this paper we present NorQuAD: the first Norwegian question answering dataset for machine reading comprehension. The dataset consists of 4,752 manually created question-answer pairs. We here detail the data collection procedure and present statistics of the dataset. We also benchmark several multilingual and Norwegian monolingual language models on the dataset and compare them against human performance. The dataset will be made freely available.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset