Two Stage Transformer Model for COVID-19 Fake News Detection and Fact Checking

by   Rutvik Vijjali, et al.

The rapid advancement of technology in online communication via social media platforms has led to a prolific rise in the spread of misinformation and fake news. Fake news is especially rampant in the current COVID-19 pandemic, leading to people believing in false and potentially harmful claims and stories. Detecting fake news quickly can alleviate the spread of panic, chaos and potential health hazards. We developed a two stage automated pipeline for COVID-19 fake news detection using state of the art machine learning models for natural language processing. The first model leverages a novel fact checking algorithm that retrieves the most relevant facts concerning user claims about particular COVID-19 claims. The second model verifies the level of truth in the claim by computing the textual entailment between the claim and the true facts retrieved from a manually curated COVID-19 dataset. The dataset is based on a publicly available knowledge source consisting of more than 5000 COVID-19 false claims and verified explanations, a subset of which was internally annotated and cross-validated to train and evaluate our models. We evaluate a series of models based on classical text-based features to more contextual Transformer based models and observe that a model pipeline based on BERT and ALBERT for the two stages respectively yields the best results.


page 1

page 2

page 3

page 4


Transformer based Automatic COVID-19 Fake News Detection System

Recent rapid technological advancements in online social networks such a...

Ask to Know More: Generating Counterfactual Explanations for Fake Claims

Automated fact checking systems have been proposed that quickly provide ...

Combat COVID-19 Infodemic Using Explainable Natural Language Processing Models

Misinformation of COVID-19 is prevalent on social media as the pandemic ...

The False COVID-19 Narratives That Keep Being Debunked: A Spatiotemporal Analysis

The onset of the Coronavirus disease 2019 (COVID-19) pandemic instigated...

Sentimental LIAR: Extended Corpus and Deep Learning Models for Fake Claim Classification

The rampant integration of social media in our every day lives and cultu...

Check_square at CheckThat! 2020: Claim Detection in Social Media via Fusion of Transformer and Syntactic Features

In this digital age of news consumption, a news reader has the ability t...

Rating Facts under Coarse-to-fine Regimes

The rise of manipulating fake news as a political weapon has become a gl...

Please sign up or login with your details

Forgot password? Click here to reset