Universality laws for Gaussian mixtures in generalized linear models

02/17/2023
βˆ™
by   Yatin Dandi, et al.
βˆ™
1
βˆ™

Let (x_i, y_i)_i=1,…,n denote independent samples from a general mixture distribution βˆ‘_cβˆˆπ’žΟ_cP_c^x, and consider the hypothesis class of generalized linear models Ε· = F(Θ^⊀x). In this work, we investigate the asymptotic joint statistics of the family of generalized linear estimators (Θ_1, …, Θ_M) obtained either from (a) minimizing an empirical risk RΜ‚_n(Θ;X,y) or (b) sampling from the associated Gibbs measure exp(-Ξ² n RΜ‚_n(Θ;X,y)). Our main contribution is to characterize under which conditions the asymptotic joint statistics of this family depends (on a weak sense) only on the means and covariances of the class conditional features distribution P_c^x. In particular, this allow us to prove the universality of different quantities of interest, such as the training and generalization errors, redeeming a recent line of work in high-dimensional statistics working under the Gaussian mixture hypothesis. Finally, we discuss the applications of our results to different machine learning tasks of interest, such as ensembling and uncertainty

READ FULL TEXT
research
βˆ™ 06/07/2021

Learning Gaussian Mixtures with Generalised Linear Models: Precise Asymptotics in High-dimensions

Generalised linear models for multi-class classification problems are on...
research
βˆ™ 02/17/2023

Are Gaussian data all you need? Extents and limits of universality in high-dimensional generalized linear estimation

In this manuscript we consider the problem of generalized linear estimat...
research
βˆ™ 11/05/2018

Mixture of generalized linear models: identifiability and applications

We consider finite mixtures of generalized linear models with binary out...
research
βˆ™ 07/14/2015

Robust Estimation of Self-Exciting Generalized Linear Models with Application to Neuronal Modeling

We consider the problem of estimating self-exciting generalized linear m...
research
βˆ™ 05/26/2022

Gaussian Universality of Linear Classifiers with Random Labels in High-Dimension

While classical in many theoretical settings, the assumption of Gaussian...
research
βˆ™ 03/24/2018

Asymptotic Representations of Statistics in the Functional Empirical process : A portal and some applications

In this research monograph, we deal with a very general asymptotic repre...
research
βˆ™ 11/16/2009

Kullback-Leibler aggregation and misspecified generalized linear models

In a regression setup with deterministic design, we study the pure aggre...

Please sign up or login with your details

Forgot password? Click here to reset