Understanding Difficulty-based Sample Weighting with a Universal Difficulty Measure

01/12/2023
by   Xiaoling Zhou, et al.
0

Sample weighting is widely used in deep learning. A large number of weighting methods essentially utilize the learning difficulty of training samples to calculate their weights. In this study, this scheme is called difficulty-based weighting. Two important issues arise when explaining this scheme. First, a unified difficulty measure that can be theoretically guaranteed for training samples does not exist. The learning difficulties of the samples are determined by multiple factors including noise level, imbalance degree, margin, and uncertainty. Nevertheless, existing measures only consider a single factor or in part, but not in their entirety. Second, a comprehensive theoretical explanation is lacking with respect to demonstrating why difficulty-based weighting schemes are effective in deep learning. In this study, we theoretically prove that the generalization error of a sample can be used as a universal difficulty measure. Furthermore, we provide formal theoretical justifications on the role of difficulty-based weighting for deep learning, consequently revealing its positive influences on both the optimization dynamics and generalization performance of deep models, which is instructive to existing weighting schemes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2022

Exploring the Learning Difficulty of Data Theory and Measure

As learning difficulty is crucial for machine learning (e.g., difficulty...
research
03/28/2021

Understanding the role of importance weighting for deep learning

The recent paper by Byrd Lipton (2019), based on empirical observati...
research
10/11/2021

Which Samples Should be Learned First: Easy or Hard?

An effective weighting scheme for training samples is essential for lear...
research
09/02/2023

Domain Generalization via Balancing Training Difficulty and Model Capability

Domain generalization (DG) aims to learn domain-generalizable models fro...
research
06/17/2021

Deep Learning Through the Lens of Example Difficulty

Existing work on understanding deep learning often employs measures that...
research
02/27/2013

A Stratified Simulation Scheme for Inference in Bayesian Belief Networks

Simulation schemes for probabilistic inference in Bayesian belief networ...
research
12/24/2017

Optimal Weighting for Exam Composition

A problem faced by many instructors is that of designing exams that accu...

Please sign up or login with your details

Forgot password? Click here to reset