Poisson PCA for matrix count data

10/27/2021
by   Joni Virta, et al.
0

We develop a dimension reduction framework for data consisting of matrices of counts. Our model is based on assuming the existence of a small amount of independent normal latent variables that drive the dependency structure of the observed data, and can be seen as the exact discrete analogue for a contaminated low-rank matrix normal model. We derive estimators for the model parameters and establish their root-n consistency. An extension of a recent proposal from the literature is used to estimate the latent dimension of the model. Additionally, a sparsity-accommodating variant of the model is considered. The method is shown to surpass both its vectorization-based competitors and matrix methods assuming the continuity of the data distribution in analysing simulated data and real abundance data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2015

Adaptive Randomized Dimension Reduction on Massive Data

The scalability of statistical estimators is of increasing importance in...
research
11/07/2012

Randomized Dimension Reduction on Massive Data

Scalability of statistical estimators is of increasing importance in mod...
research
10/26/2019

Zero-inflated Poisson Factor Model with Application to Microbiome Absolute Abundance Data

Dimension reduction of high-dimensional microbiome data facilitates subs...
research
01/23/2015

Bayesian Learning for Low-Rank matrix reconstruction

We develop latent variable models for Bayesian learning based low-rank m...
research
01/28/2020

Low-rank matrix denoising for count data using unbiased Kullback-Leibler risk estimation

This paper is concerned by the analysis of observations organized in a m...
research
11/25/2019

Matrix Normal PCA for Interpretable Dimension Reduction and Graphical Noise Modeling

Principal component analysis (PCA) is one of the most widely used dimens...
research
08/31/2023

Haplotype frequency inference from pooled genetic data with a latent multinomial model

In genetic studies, haplotype data provide more refined information than...

Please sign up or login with your details

Forgot password? Click here to reset