Selective Inference for Latent Block Models

05/27/2020
by   Chihiro Watanabe, et al.
21

Model selection in latent block models has been a challenging but important task in the field of statistics. Specifically, a major challenge is encountered when constructing a test on a block structure obtained by applying a specific clustering algorithm to a finite size matrix. In this case, it becomes crucial to consider the selective bias in the block structure, that is, the block structure is selected from all the possible cluster memberships based on some criterion by the clustering algorithm. To cope with this problem, this study provides a selective inference method for latent block models. Specifically, we construct a statistical test on a set of row and column cluster memberships of a latent block model, which is given by a squared residue minimization algorithm. The proposed test, by its nature, includes and thus can also be used as the test on the set of row and column cluster numbers. We also propose an approximated version of the test based on simulated annealing to avoid combinatorial explosion in searching the optimal block structure. The results show that the proposed exact and approximated tests work effectively, compared to the naive test that did not take the selective bias into account.

READ FULL TEXT

page 13

page 17

page 20

research
06/10/2019

Goodness-of-fit Test for Latent Block Models

Latent Block Models are used for probabilistic biclustering, which is sh...
research
01/30/2023

Selective inference for clustering with unknown variance

In many modern statistical problems, the limited available data must be ...
research
09/13/2021

ℋ-inverses for RBF interpolation

We consider the interpolation problem for a class of radial basis functi...
research
02/22/2014

Scaling Nonparametric Bayesian Inference via Subsample-Annealing

We describe an adaptation of the simulated annealing algorithm to nonpar...
research
06/16/2022

Variational Estimators of the Degree-corrected Latent Block Model for Bipartite Networks

Biclustering on bipartite graphs is an unsupervised learning task that s...
research
08/03/2020

Conditional Latent Block Model: a Multivariate Time Series Clustering Approach for Autonomous Driving Validation

Autonomous driving systems validation remains one of the biggest challen...
research
07/26/2018

Selective Clustering Annotated using Modes of Projections

Selective clustering annotated using modes of projections (SCAMP) is a n...

Please sign up or login with your details

Forgot password? Click here to reset