selectBoost: a general algorithm to enhance the performance of variable selection methods in correlated datasets

10/03/2018
by   Ismaïl Aouadi, et al.
0

Motivation: With the growth of big data, variable selection has become one of the major challenges in statistics. Although many methods have been proposed in the literature their performance in terms of recall and precision are limited in a context where the number of variables by far exceeds the number of observations or in a high correlated setting. Results: In this article, we propose a general algorithm which improves the precision of any existing variable selection method. This algorithm is based on highly intensive simulations and takes into account the correlation structure of the data. Our algorithm can either produce a confidence index for variable selection or it can be used in an experimental design planning perspective. We demonstrate the performance of our algorithm on both simulated and real data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/08/2018

Multivariate Spatial-Temporal Variable Selection with Applications to Seasonal Tropical Cyclone Modeling

Tropical cyclone and sea surface temperature data have been used in seve...
research
07/04/2023

Scalable variable selection for two-view learning tasks with projection operators

In this paper we propose a novel variable selection method for two-view ...
research
10/28/2019

An Ensemble Approach toward Automated Variable Selection for Network Anomaly Detection

While variable selection is essential to optimize the learning complexit...
research
03/02/2019

Sequential estimation for GEE with adaptive variables and subject selection

Modeling correlated or highly stratified multiple-response data becomes ...
research
10/29/2016

A general multiblock method for structured variable selection

Regularised canonical correlation analysis was recently extended to more...

Please sign up or login with your details

Forgot password? Click here to reset