High-dimensional Variable Screening via Conditional Martingale Difference Divergence

06/23/2022
by   Lei Fang, et al.
0

Variable screening has been a useful research area that helps to deal with ultra-high-dimensional data. When there exist both marginally and jointly dependent predictors to the response, existing methods such as conditional screening or iterative screening often suffer from the instability against the selection of the conditional set or the computational burden, respectively. In this paper, we propose a new independence measure, named conditional martingale difference divergence (CMDH), that can be treated as either a conditional or a marginal independence measure. Under regularity conditions, we show that the sure screening property of CMDH holds for both marginally and jointly active variables. Based on this measure, we propose a kernel-based model-free variable screening method that is efficient, flexible, and stable against high correlation and heterogeneity. In addition, we provide a data-driven method of conditional set selection, when the conditional set is unknown. In simulations and real data applications, we demonstrate the superior performance of the proposed method.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset