Streaming Bayesian Inference for Crowdsourced Classification

11/13/2019
by   Edoardo Manino, et al.
0

A key challenge in crowdsourcing is inferring the ground truth from noisy and unreliable data. To do so, existing approaches rely on collecting redundant information from the crowd, and aggregating it with some probabilistic method. However, oftentimes such methods are computationally inefficient, are restricted to some specific settings, or lack theoretical guarantees. In this paper, we revisit the problem of binary classification from crowdsourced data. Specifically we propose Streaming Bayesian Inference for Crowdsourcing (SBIC), a new algorithm that does not suffer from any of these limitations. First, SBIC has low complexity and can be used in a real-time online setting. Second, SBIC has the same accuracy as the best state-of-the-art algorithms in all settings. Third, SBIC has provable asymptotic guarantees both in the online and offline settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2016

A Minimax Optimal Algorithm for Crowdsourcing

We consider the problem of accurately estimating the reliability of work...
research
04/30/2013

Inferring ground truth from multi-annotator ordinal data: a probabilistic approach

A popular approach for large scale data annotation tasks is crowdsourcin...
research
01/26/2012

Dynamic trees for streaming and massive data contexts

Data collection at a massive scale is becoming ubiquitous in a wide vari...
research
04/08/2019

A Generalization Bound for Online Variational Inference

Bayesian inference provides an attractive online-learning framework to a...
research
08/31/2017

Identifying Unsafe Videos on Online Public Media using Real-time Crowdsourcing

Due to the significant growth of social networking and human activities ...
research
07/18/2014

Bayesian Nonparametric Crowdsourcing

Crowdsourcing has been proven to be an effective and efficient tool to a...
research
06/21/2021

Scalable Bayesian inference for time series via divide-and-conquer

Bayesian computational algorithms tend to scale poorly as data size incr...

Please sign up or login with your details

Forgot password? Click here to reset