Large-Scale Model Selection with Misspecification

03/17/2018
by   Emre Demirkaya, et al.
0

Model selection is crucial to high-dimensional learning and inference for contemporary big data applications in pinpointing the best set of covariates among a sequence of candidate interpretable models. Most existing work assumes implicitly that the models are correctly specified or have fixed dimensionality. Yet both features of model misspecification and high dimensionality are prevalent in practice. In this paper, we exploit the framework of model selection principles in misspecified models originated in Lv and Liu (2014) and investigate the asymptotic expansion of Bayesian principle of model selection in the setting of high-dimensional misspecified models. With a natural choice of prior probabilities that encourages interpretability and incorporates Kullback-Leibler divergence, we suggest the high-dimensional generalized Bayesian information criterion with prior probability (HGBIC_p) for large-scale model selection with misspecification. Our new information criterion characterizes the impacts of both model misspecification and high dimensionality on model selection. We further establish the consistency of covariance contrast matrix estimation and the model selection consistency of HGBIC_p in ultra-high dimensions under some mild regularity conditions. The advantages of our new method are supported by numerical studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/23/2014

Model Selection in High-Dimensional Misspecified Models

Model selection is indispensable to high-dimensional sparse modeling in ...
research
09/06/2021

Bayesian data selection

Insights into complex, high-dimensional data can be obtained by discover...
research
05/15/2019

Revisiting High Dimensional Bayesian Model Selection for Gaussian Regression

Model selection for regression problems with an increasing number of cov...
research
02/08/2019

Bayesian Model Selection with Graph Structured Sparsity

We propose a general algorithmic framework for Bayesian model selection....
research
07/08/2019

Competing Models

Different agents compete to predict a variable of interest related to a ...
research
02/05/2015

A Confident Information First Principle for Parametric Reduction and Model Selection of Boltzmann Machines

Typical dimensionality reduction (DR) methods are often data-oriented, f...
research
10/12/2018

The good, the bad, and the ugly: Bayesian model selection produces spurious posterior probabilities for phylogenetic trees

The Bayesian method is noted to produce spuriously high posterior probab...

Please sign up or login with your details

Forgot password? Click here to reset