Robust Correction of Sampling Bias Using Cumulative Distribution Functions

10/23/2020
by   Bijan Mazaheri, et al.
0

Varying domains and biased datasets can lead to differences between the training and the target distributions, known as covariate shift. Current approaches for alleviating this often rely on estimating the ratio of training and target probability density functions. These techniques require parameter tuning and can be unstable across different datasets. We present a new method for handling covariate shift using the empirical cumulative distribution function estimates of the target distribution by a rigorous generalization of a recent idea proposed by Vapnik and Izmailov. Further, we show experimentally that our method is more robust in its predictions, is not reliant on parameter tuning and shows similar classification performance compared to the current state-of-the-art techniques on synthetic and real datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/28/2017

Kernel Robust Bias-Aware Prediction under Covariate Shift

Under covariate shift, training (source) data and testing (target) data ...
research
05/15/2023

Double-Weighting for Covariate Shift Adaptation

Supervised learning is often affected by a covariate shift in which the ...
research
09/29/2022

Dataset Complexity Assessment Based on Cumulative Maximum Scaled Area Under Laplacian Spectrum

Dataset complexity assessment aims to predict classification performance...
research
09/17/2022

Mitigating Both Covariate and Conditional Shift for Domain Generalization

Domain generalization (DG) aims to learn a model on several source domai...
research
03/29/2023

Sparse joint shift in multinomial classification

Sparse joint shift (SJS) was recently proposed as a tractable model for ...
research
12/07/2021

A Unified Framework for Multi-distribution Density Ratio Estimation

Binary density ratio estimation (DRE), the problem of estimating the rat...
research
04/18/2023

A Domain-Region Based Evaluation of ML Performance Robustness to Covariate Shift

Most machine learning methods assume that the input data distribution is...

Please sign up or login with your details

Forgot password? Click here to reset