Bayesian Distance Weighted Discrimination

10/07/2020
by   Eric F. Lock, et al.
0

Distance weighted discrimination (DWD) is a linear discrimination method that is particularly well-suited for classification tasks with high-dimensional data. The DWD coefficients minimize an intuitive objective function, which can solved very efficiently using state-of-the-art optimization techniques. However, DWD has not yet been cast into a model-based framework for statistical inference. In this article we show that DWD identifies the mode of a proper Bayesian posterior distribution, that results from a particular link function for the class probabilities and a shrinkage-inducing proper prior distribution on the coefficients. We describe a relatively efficient Markov chain Monte Carlo (MCMC) algorithm to simulate from the true posterior under this Bayesian framework. We show that the posterior is asymptotically normal and derive the mean and covariance matrix of its limiting distribution. Through several simulation studies and an application to breast cancer genomics we demonstrate how the Bayesian approach to DWD can be used to (1) compute well-calibrated posterior class probabilities, (2) assess uncertainty in the DWD coefficients and resulting sample scores, (3) improve power via semi-supervised analysis when not all class labels are available, and (4) automatically determine a penalty tuning parameter within the model-based framework. R code to perform Bayesian DWD is available at https://github.com/lockEF/BayesianDWD .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2021

Reversible Genetically Modified Mode Jumping MCMC

In this paper, we introduce a reversible version of a genetically modifi...
research
09/16/2021

Statistical Inference for Bayesian Risk Minimization via Exponentially Tilted Empirical Likelihood

The celebrated Bernstein von-Mises theorem ensures that credible regions...
research
02/26/2018

Conjugate Bayes for probit regression via unified skew-normals

Regression models for dichotomous data are ubiquitous in statistics. Bes...
research
03/27/2018

Regularization and Computation with high-dimensional spike-and-slab posterior distributions

We consider the Bayesian analysis of a high-dimensional statistical mode...
research
12/08/2020

Robust Sparse Bayesian Infinite Factor Models

Most of previous works and applications of Bayesian factor model have as...
research
03/04/2017

An unsupervised bayesian approach for the joint reconstruction and classification of cutaneous reflectance confocal microscopy images

This paper studies a new Bayesian algorithm for the joint reconstruction...
research
02/15/2019

BAREB: A Bayesian repulsive biclustering model for periodontal data

Preventing periodontal diseases (PD) and maintaining the structure and f...

Please sign up or login with your details

Forgot password? Click here to reset