More Powerful Selective Kernel Tests for Feature Selection

10/14/2019
by   Jen Ning Lim, et al.
12

Refining one's hypotheses in the light of data is a commonplace scientific practice, however, this approach introduces selection bias and can lead to specious statistical analysis. One approach of addressing this phenomena is via conditioning on the selection procedure, i.e., how we have used the data to generate our hypotheses, and prevents information to be used again after selection. Many selective inference (a.k.a. post-selection inference) algorithms typically take this approach but will "over-condition" for sake of tractability. While this practice obtains well calibrated p-values, it can incur a major loss in power. In our work, we extend two recent proposals for selecting features using the Maximum Mean Discrepancy and Hilbert Schmidt Independence Criterion to condition on the minimal conditioning event. We show how recent advances in multiscale bootstrap makes conditioning on the minimal selection event possible and demonstrate our proposal over a range of synthetic and real world experiments. Our results show that our proposed test is indeed more powerful in most scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/25/2020

More Powerful and General Selective Inference for Stepwise Feature Selection using the Homotopy Continuation Approach

Conditional Selective Inference (SI) has been actively studied as a new ...
research
11/01/2017

Post-selection estimation and testing following aggregated association tests

The practice of pooling several individual test statistics to form aggre...
research
07/27/2022

Conditional Versus Unconditional Approaches to Selective Inference

We investigate a class of methods for selective inference that condition...
research
01/27/2018

More powerful post-selection inference, with application to the Lasso

Investigators often use the data to generate interesting hypotheses and ...
research
04/21/2020

Parametric Programming Approach for Powerful Lasso Selective Inference without Conditioning on Signs

In the past few years, Selective Inference (SI) has been actively studie...
research
01/13/2023

Improving Power by Conditioning on Less in Post-selection Inference for Changepoints

Post-selection inference has recently been proposed as a way of quantify...
research
11/02/2022

Inferring independent sets of Gaussian variables after thresholding correlations

We consider testing whether a set of Gaussian variables, selected from t...

Please sign up or login with your details

Forgot password? Click here to reset