Inference in High-dimensional Linear Regression

06/22/2021
by   Heather S. Battey, et al.
0

We develop an approach to inference in a linear regression model when the number of potential explanatory variables is larger than the sample size. Our approach treats each regression coefficient in turn as the interest parameter, the remaining coefficients being nuisance parameters, and seeks an optimal interest-respecting transformation. The role of this transformation is to allow a marginal least squares analysis for each variable, as in a factorial experiment. One parameterization of the problem is found to be particularly convenient, both computationally and mathematically. In particular, it permits an analytic solution to the optimal transformation problem, facilitating comparison to other work. In contrast to regularized regression such as the lasso (Tibshirani, 1996) and its extensions, neither adjustment for selection, nor rescaling of the explanatory variables is needed, ensuring the physical interpretation of regression coefficients is retained. We discuss the use of such confidence intervals as part of a broader set of inferential statements, so as to reflect uncertainty over the model as well as over the parameters. The considerations involved in extending the work to other regression models are briefly discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2014

Exact Post Model Selection Inference for Marginal Screening

We develop a framework for post model selection inference, via marginal ...
research
05/18/2020

Selective Confidence Intervals for Martingale Regression Model

In this paper we consider the problem of constructing confidence interva...
research
04/30/2021

Explanation of multicollinearity using the decomposition theorem of ordinary linear regression models

In a multiple linear regression model, the algebraic formula of the deco...
research
08/01/2016

hdm: High-Dimensional Metrics

In this article the package High-dimensional Metrics (hdm) is introduced...
research
09/20/2018

Admissibility of the usual confidence set for the mean of a univariate or bivariate normal population: The unknown-variance case

In the Gaussian linear regression model (with unknown mean and variance)...
research
09/24/2019

Double-estimation-friendly inference for high-dimensional misspecified models

All models may be wrong—but that is not necessarily a problem for infere...
research
09/01/2023

Interpretation of High-Dimensional Linear Regression: Effects of Nullspace and Regularization Demonstrated on Battery Data

High-dimensional linear regression is important in many scientific field...

Please sign up or login with your details

Forgot password? Click here to reset