ReSCo-CC: Unsupervised Identification of Key Disinformation Sentences

10/21/2020
by   Soumya Suvra Ghosal, et al.
0

Disinformation is often presented in long textual articles, especially when it relates to domains such as health, often seen in relation to COVID-19. These articles are typically observed to have a number of trustworthy sentences among which core disinformation sentences are scattered. In this paper, we propose a novel unsupervised task of identifying sentences containing key disinformation within a document that is known to be untrustworthy. We design a three-phase statistical NLP solution for the task which starts with embedding sentences within a bespoke feature space designed for the task. Sentences represented using those features are then clustered, following which the key sentences are identified through proximity scoring. We also curate a new dataset with sentence level disinformation scorings to aid evaluation for this task; the dataset is being made publicly available to facilitate further research. Based on a comprehensive empirical evaluation against techniques from related tasks such as claim detection and summarization, as well as against simplified variants of our proposed approach, we illustrate that our method is able to identify core disinformation effectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/21/2022

SciBERTSUM: Extractive Summarization for Scientific Documents

The summarization literature focuses on the summarization of news articl...
research
11/03/2018

Unsupervised Identification of Study Descriptors in Toxicology Research: An Experimental Study

Identifying and extracting data elements such as study descriptors in pu...
research
09/06/2019

Features in Extractive Supervised Single-document Summarization: Case of Persian News

Text summarization has been one of the most challenging areas of researc...
research
08/05/2017

Extractive Multi Document Summarization using Dynamical Measurements of Complex Networks

Due to the large amount of textual information available on Internet, it...
research
03/18/2016

Readability-based Sentence Ranking for Evaluating Text Simplification

We propose a new method for evaluating the readability of simplified sen...
research
04/18/2021

On the Use of Context for Predicting Citation Worthiness of Sentences in Scholarly Articles

In this paper, we study the importance of context in predicting the cita...
research
08/25/2020

Extractive Summarizer for Scholarly Articles

We introduce an extractive method that will summarize long scientific pa...

Please sign up or login with your details

Forgot password? Click here to reset