Bandits for Online Calibration: An Application to Content Moderation on Social Media Platforms

11/11/2022
by   Vashist Avadhanula, et al.
0

We describe the current content moderation strategy employed by Meta to remove policy-violating content from its platforms. Meta relies on both handcrafted and learned risk models to flag potentially violating content for human review. Our approach aggregates these risk models into a single ranking score, calibrating them to prioritize more reliable risk models. A key challenge is that violation trends change over time, affecting which risk models are most reliable. Our system additionally handles production challenges such as changing risk models and novel risk models. We use a contextual bandit to update the calibration in response to such trends. Our approach increases Meta's top-line metric for measuring the effectiveness of its content moderation strategy by 13

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2023

Statistical Methods for Auditing the Quality of Manual Content Reviews

Large technology firms face the problem of moderating content on their o...
research
10/21/2020

Top 5 Content Marketing Trends for 2020

As content #marketing continues to be an indispensable element of an org...
research
08/18/2019

Modeling Islamist Extremist Communications on Social Media using Contextual Dimensions: Religion, Ideology, and Hate

Terror attacks have been linked in part to online extremist content. Alt...
research
08/16/2022

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

Social media platforms struggle to protect users from harmful content th...
research
10/31/2022

Listen to what they say: Better understand and detect online misinformation with user feedback

Social media users who report content are key allies in the management o...
research
05/10/2021

Not All Relevance Scores are Equal: Efficient Uncertainty and Calibration Modeling for Deep Retrieval Models

In any ranking system, the retrieval model outputs a single score for a ...
research
11/10/2021

A Meta-Method for Portfolio Management Using Machine Learning for Adaptive Strategy Selection

This work proposes a novel portfolio management technique, the Meta Port...

Please sign up or login with your details

Forgot password? Click here to reset