Active Learning with Label Comparisons

04/10/2022
by   Gal Yona, et al.
0

Supervised learning typically relies on manual annotation of the true labels. When there are many potential classes, searching for the best one can be prohibitive for a human annotator. On the other hand, comparing two candidate labels is often much easier. We focus on this type of pairwise supervision and ask how it can be used effectively in learning, and in particular in active learning. We obtain several insightful results in this context. In principle, finding the best of k labels can be done with k-1 active queries. We show that there is a natural class where this approach is sub-optimal, and that there is a more comparison-efficient active learning scheme. A key element in our analysis is the "label neighborhood graph" of the true distribution, which has an edge between two classes if they share a decision boundary. We also show that in the PAC setting, pairwise comparisons cannot provide improved sample complexity in the worst case. We complement our theoretical results with experiments, clearly demonstrating the effect of the neighborhood graph on sample complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2019

The Power of Comparisons for Actively Learning Linear Classifiers

In the world of big data, large but costly to label datasets dominate ma...
research
07/06/2020

The Sample Complexity of Best-k Items Selection from Pairwise Comparisons

This paper studies the sample complexity (aka number of comparisons) bou...
research
10/03/2021

Active Learning for Contextual Search with Binary Feedbacks

In this paper, we study the learning problem in contextual search, which...
research
11/25/2018

HS^2: Active Learning over Hypergraphs

We propose a hypergraph-based active learning scheme which we term HS^2,...
research
03/25/2021

Active Structure Learning of Bayesian Networks in an Observational Setting

We study active structure learning of Bayesian networks in an observatio...
research
02/21/2018

Active Learning with Partial Feedback

In the large-scale multiclass setting, assigning labels often consists o...
research
07/16/2020

Active Learning under Label Shift

Distribution shift poses a challenge for active data collection in the r...

Please sign up or login with your details

Forgot password? Click here to reset