A Chinese Multi-type Complex Questions Answering Dataset over Wikidata

11/11/2021
by   Jianyun Zou, et al.
0

Complex Knowledge Base Question Answering is a popular area of research in the past decade. Recent public datasets have led to encouraging results in this field, but are mostly limited to English and only involve a small number of question types and relations, hindering research in more realistic settings and in languages other than English. In addition, few state-of-the-art KBQA models are trained on Wikidata, one of the most popular real-world knowledge bases. We propose CLC-QuAD, the first large scale complex Chinese semantic parsing dataset over Wikidata to address these challenges. Together with the dataset, we present a text-to-SPARQL baseline model, which can effectively answer multi-type complex questions, such as factual questions, dual intent questions, boolean questions, and counting questions, with Wikidata as the background knowledge. We finally analyze the performance of SOTA KBQA models on this dataset and identify the challenges facing Chinese KBQA.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2023

IslamicPCQA: A Dataset for Persian Multi-hop Complex Question Answering in Islamic Text Resources

Nowadays, one of the main challenges for Question Answering Systems is t...
research
10/18/2020

Querent Intent in Multi-Sentence Questions

Multi-sentence questions (MSQs) are sequences of questions connected by ...
research
08/15/2021

Complex Knowledge Base Question Answering: A Survey

Knowledge base question answering (KBQA) aims to answer a question over ...
research
12/20/2022

Do I have the Knowledge to Answer? Investigating Answerability of Knowledge Base Questions

When answering natural language questions over knowledge bases (KBs), in...
research
09/28/2020

What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams

Open domain question answering (OpenQA) tasks have been recently attract...
research
12/14/2021

Few-shot Multi-hop Question Answering over Knowledge Base

Previous work on Chinese Knowledge Base Question Answering has been rest...
research
01/19/2023

Reversing The Twenty Questions Game

Twenty questions is a widely popular verbal game. In recent years, many ...

Please sign up or login with your details

Forgot password? Click here to reset