Learning Sparsity and Block Diagonal Structure in Multi-View Mixture Models

12/30/2020
by   Iain Carmichael, et al.
0

Scientific studies increasingly collect multiple modalities of data to investigate a phenomenon from several perspectives. In integrative data analysis it is important to understand how information is heterogeneously spread across these different data sources. To this end, we consider a parametric clustering model for the subjects in a multi-view data set (i.e. multiple sources of data from the same set of subjects) where each view marginally follows a mixture model. In the case of two views, the dependence between them is captured by a cluster membership matrix parameter and we aim to learn the structure of this matrix (e.g. the zero pattern). First, we develop a penalized likelihood approach to estimate the sparsity pattern of the cluster membership matrix. For the specific case of block diagonal structures, we develop a constrained likelihood formulation where this matrix is constrained to be block diagonal up to permutations of the rows and columns. To enforce block diagonal constraints we propose a novel optimization approach based on the symmetric graph Laplacian. We demonstrate the performance of these methods through both simulations and applications to data sets from cancer genetics and neuroscience. Both methods naturally extend to multiple views.

READ FULL TEXT
research
08/25/2016

Multi-View Fuzzy Clustering with Minimax Optimization for Effective Clustering of Data from Multiple Sources

Multi-view data clustering refers to categorizing a data set by making g...
research
08/25/2016

Incremental Minimax Optimization based Fuzzy Clustering for Large Multi-view Data

Incremental clustering approaches have been proposed for handling large ...
research
05/23/2018

Subspace Clustering by Block Diagonal Representation

This paper studies the subspace clustering problem. Given some data poin...
research
11/16/2021

Sparse Graph Learning Under Laplacian-Related Constraints

We consider the problem of learning a sparse undirected graph underlying...
research
12/11/2019

Integrative Generalized Convex Clustering Optimization and Feature Selection for Mixed Multi-View Data

In mixed multi-view data, multiple sets of diverse features are measured...
research
03/17/2020

Directionally Dependent Multi-View Clustering Using Copula Model

In recent biomedical scientific problems, it is a fundamental issue to i...
research
11/23/2019

Learning a Representation with the Block-Diagonal Structure for Pattern Classification

Sparse-representation-based classification (SRC) has been widely studied...

Please sign up or login with your details

Forgot password? Click here to reset