Variable Selection in GLM and Cox Models with Second-Generation P-Values

09/20/2021
by   Yi Zuo, et al.
0

Variable selection has become a pivotal choice in data analyses that impacts subsequent inference and prediction. In linear models, variable selection using Second-Generation P-Values (SGPV) has been shown to be as good as any other algorithm available to researchers. Here we extend the idea of Penalized Regression with Second-Generation P-Values (ProSGPV) to the generalized linear model (GLM) and Cox regression settings. The proposed ProSGPV extension is largely free of tuning parameters, adaptable to various regularization schemes and null bound specifications, and is computationally fast. Like in the linear case, it excels in support recovery and parameter estimation while maintaining strong prediction performance. The algorithm also preforms as well as its competitors in the high dimensional setting (n>p). Slight modifications of the algorithm improve its performance when data are highly correlated or when signals are dense. This work significantly strengthens the case for the ProSGPV approach to variable selection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2020

Variable Selection with Second-Generation P-Values

Many statistical methods have been proposed for variable selection in th...
research
02/01/2020

Higher Criticism Tuned Regression For Weak And Sparse Signals

Here we propose a novel searching scheme for a tuning parameter in high-...
research
08/16/2012

Consistent selection of tuning parameters via variable selection stability

Penalized regression models are popularly used in high-dimensional data ...
research
06/12/2022

Simple Robust Estimating Method for Generalized Linear Models and its Application to Propensity Score Estimation

A generalized linear model is one of the most well-known model families ...
research
03/19/2020

Semi-analytic approximate stability selection for correlated data in generalized linear models

We consider the variable selection problem of generalized linear models ...
research
01/21/2021

A General Framework of Online Updating Variable Selection for Generalized Linear Models with Streaming Datasets

In the research field of big data, one of important issues is how to rec...
research
12/16/2022

Multi-Task Learning for Sparsity Pattern Heterogeneity: A Discrete Optimization Approach

We extend best-subset selection to linear Multi-Task Learning (MTL), whe...

Please sign up or login with your details

Forgot password? Click here to reset