On Focal Loss for Class-Posterior Probability Estimation: A Theoretical Perspective

11/18/2020
by   Nontawat Charoenphakdee, et al.
0

The focal loss has demonstrated its effectiveness in many real-world applications such as object detection and image classification, but its theoretical understanding has been limited so far. In this paper, we first prove that the focal loss is classification-calibrated, i.e., its minimizer surely yields the Bayes-optimal classifier and thus the use of the focal loss in classification can be theoretically justified. However, we also prove a negative fact that the focal loss is not strictly proper, i.e., the confidence score of the classifier obtained by focal loss minimization does not match the true class-posterior probability and thus it is not reliable as a class-posterior probability estimator. To mitigate this problem, we next prove that a particular closed-form transformation of the confidence score allows us to recover the true class-posterior probability. Through experiments on benchmark datasets, we demonstrate that our proposed transformation significantly improves the accuracy of class-posterior probability estimation.

READ FULL TEXT

page 9

page 41

page 42

research
09/30/2019

Tutorial on Implied Posterior Probability for SVMs

Implied posterior probability of a given model (say, Support Vector Mach...
research
09/23/2022

Posterior Probabilities: Dominance and Optimism

The Bayesian posterior probability of the true state is stochastically d...
research
03/28/2021

Entropy methods for the confidence assessment of probabilistic classification models

Many classification models produce a probability distribution as the out...
research
10/28/2022

Beyond calibration: estimating the grouping loss of modern neural networks

Good decision making requires machine-learning models to provide trustwo...
research
02/07/2022

Theoretical characterization of uncertainty in high-dimensional linear classification

Being able to reliably assess not only the accuracy but also the uncerta...
research
10/23/2019

Closed-Form Full Map Posteriors for Robot Localization with Lidar Sensors

A popular class of lidar-based grid mapping algorithms computes for each...
research
01/29/2023

Learning to reject meets OOD detection: Are all abstentions created equal?

Learning to reject (L2R) and out-of-distribution (OOD) detection are two...

Please sign up or login with your details

Forgot password? Click here to reset