AraStance: A Multi-Country and Multi-Domain Dataset of Arabic Stance Detection for Fact Checking

04/28/2021
by   Tariq Alhindi, et al.
6

With the continuing spread of misinformation and disinformation online, it is of increasing importance to develop combating mechanisms at scale in the form of automated systems that support multiple languages. One task of interest is claim veracity prediction, which can be addressed using stance detection with respect to relevant documents retrieved online. To this end, we present our new Arabic Stance Detection dataset (AraStance) of 910 claims from a diverse set of sources comprising three fact-checking websites and one news website. AraStance covers false and true claims from multiple domains (e.g., politics, sports, health) and several Arab countries, and it is wellbalanced between related and unrelated documents with respect to the claims. We benchmark AraStance, along with two other stance detection datasets, using a number of BERTbased models. Our best model achieves an accuracy of 85 leaves room for improvement and reflects the challenging nature of AraStance and the task of stance detection in general.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2020

ArCOV19-Rumors: Arabic COVID-19 Twitter Dataset for Misinformation Detection

In this paper we introduce ArCOV19-Rumors, an Arabic COVID-19 Twitter da...
research
06/05/2023

Early Rumor Detection Using Neural Hawkes Process with a New Benchmark Dataset

Little attention has been paid on EArly Rumor Detection (EARD), and EARD...
research
09/09/2022

PoxVerifi: An Information Verification System to Combat Monkeypox Misinformation

Following recent outbreaks, monkeypox-related misinformation continues t...
research
04/21/2018

Integrating Stance Detection and Fact Checking in a Unified Corpus

A reasonable approach for fact checking a claim involves retrieving pote...
research
06/07/2019

FAKTA: An Automatic End-to-End Fact Checking System

We present FAKTA which is a unified framework that integrates various co...
research
09/07/2019

MultiFC: A Real-World Multi-Domain Dataset for Evidence-Based Fact Checking of Claims

We contribute the largest publicly available dataset of naturally occurr...
research
01/26/2022

CsFEVER and CTKFacts: Czech Datasets for Fact Verification

In this paper, we present two Czech datasets for automated fact-checking...

Please sign up or login with your details

Forgot password? Click here to reset