Selection consistency of Lasso-based procedures for misspecified high-dimensional binary model and random regressors

06/10/2019
by   Mariusz Kubkowski, et al.
0

We consider selection of random predictors for high-dimensional regression problem with binary response for a general loss function. Important special case is when the binary model is semiparametric and the response function is misspecified under parametric model fit. Selection for such a scenario aims at recovering the support of the minimizer of the associated risk with large probability. We propose a two-step selection procedure which consists of screening and ordering predictors by Lasso method and then selecting a subset of predictors which minimizes Generalized Information Criterion on the corresponding nested family of models. We prove consistency of the selection method under conditions which allow for much larger number of predictors than number of observations. For the semiparametric case when distribution of random predictors satisfies linear regression conditions the true and the estimated parameters are collinear and their common support can be consistently identified.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/29/2011

Estimation And Selection Via Absolute Penalized Convex Minimization And Its Multistage Adaptive Applications

The ℓ_1-penalized method, or the Lasso, has emerged as an important tool...
research
09/12/2023

A Consistent and Scalable Algorithm for Best Subset Selection in Single Index Models

Analysis of high-dimensional data has led to increased interest in both ...
research
07/08/2020

Sparse Regression for Extreme Values

We study the problem of selecting features associated with extreme value...
research
10/08/2012

Group Model Selection Using Marginal Correlations: The Good, the Bad and the Ugly

Group model selection is the problem of determining a small subset of gr...
research
07/20/2020

Prediction in latent factor regression: Adaptive PCR and beyond

This work is devoted to the finite sample prediction risk analysis of a ...
research
09/08/2020

Conditional Uncorrelation and Efficient Non-approximate Subset Selection in Sparse Regression

Given m d-dimensional responsors and n d-dimensional predictors, sparse ...
research
05/14/2019

Fast and robust model selection based on ranks

We consider the problem of identifying important predictors in large dat...

Please sign up or login with your details

Forgot password? Click here to reset