Streaming Sparse Linear Regression

11/11/2022
by   Shuoguang Yang, et al.
0

Sparse regression has been a popular approach to perform variable selection and enhance the prediction accuracy and interpretability of the resulting statistical model. Existing approaches focus on offline regularized regression, while the online scenario has rarely been studied. In this paper, we propose a novel online sparse linear regression framework for analyzing streaming data when data points arrive sequentially. Our proposed method is memory efficient and requires less stringent restricted strong convexity assumptions. Theoretically, we show that with a properly chosen regularization parameter, the ℓ_2-norm statistical error of our estimator diminishes to zero in the optimal order of Õ(√(s/t)), where s is the sparsity level, t is the streaming sample size, and Õ(·) hides logarithmic terms. Numerical experiments demonstrate the practical efficiency of our algorithm.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset