Detecting Fake News with Weak Social Supervision

10/24/2019
by   Kai Shu, et al.
0

Limited labeled data is becoming the largest bottleneck for supervised learning systems. This is especially the case for many real-world tasks where large scale annotated examples are either too expensive to acquire or unavailable due to privacy or data access constraints. Weak supervision has shown to be a good means to mitigate the scarcity of annotated data by leveraging weak labels or injecting constraints from heuristic rules and/or external knowledge sources. Social media has little labeled data but possesses unique characteristics that make it suitable for generating weak supervision, resulting in a new type of weak supervision, i.e., weak social supervision. In this article, we illustrate how various aspects of social media can be used to generate weak social supervision. Specifically, we use the recent research on fake news detection as the use case, where social engagements are abundant but annotated examples are scarce, to show that weak social supervision is effective when facing the little labeled data problem. This article opens the door for learning with weak social supervision for other emerging tasks.

READ FULL TEXT
research
12/28/2019

Weak Supervision for Fake News Detection via Reinforcement Learning

Today social media has become the primary source for news. Via social me...
research
05/26/2020

Learning with Weak Supervision for Email Intent Detection

Email remains one of the most frequently used means of online communicat...
research
02/24/2022

Construction of Large-Scale Misinformation Labeled Datasets from Social Media Discourse using Label Refinement

Malicious accounts spreading misinformation has led to widespread false ...
research
06/10/2019

Deep Two-path Semi-supervised Learning for Fake News Detection

News in social media such as Twitter has been generated in high volume a...
research
01/31/2020

Two-path Deep Semi-supervised Learning for Timely Fake News Detection

News in social media such as Twitter has been generated in high volume a...
research
06/10/2022

Label Noise-Resistant Mean Teaching for Weakly Supervised Fake News Detection

Fake news spreads at an unprecedented speed, reaches global audiences an...
research
12/02/2018

Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale

Labeling training data is one of the most costly bottlenecks in developi...

Please sign up or login with your details

Forgot password? Click here to reset