Learning Mixtures of Discrete Product Distributions using Spectral Decompositions

11/12/2013
by   Prateek Jain, et al.
0

We study the problem of learning a distribution from samples, when the underlying distribution is a mixture of product distributions over discrete domains. This problem is motivated by several practical applications such as crowd-sourcing, recommendation systems, and learning Boolean functions. The existing solutions either heavily rely on the fact that the number of components in the mixtures is finite or have sample/time complexity that is exponential in the number of components. In this paper, we introduce a polynomial time/sample complexity method for learning a mixture of r discrete product distributions over {1, 2, ..., ℓ}^n, for general ℓ and r. We show that our approach is statistically consistent and further provide finite sample guarantees. We use techniques from the recent work on tensor decompositions for higher-order moment matching. A crucial step in these moment matching methods is to construct a certain matrix and a certain tensor with low-rank spectral decompositions. These tensors are typically estimated directly from the samples. The main challenge in learning mixtures of discrete product distributions is that these low-rank tensors cannot be obtained directly from the sample moments. Instead, we reduce the tensor estimation problem to: a) estimating a low-rank matrix using only off-diagonal block elements; and b) estimating a tensor using a small number of linear measurements. Leveraging on recent developments in matrix completion, we give an alternating minimization based method to estimate the low-rank matrix, and formulate the tensor completion problem as a least-squares problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2015

Symmetric Tensor Completion from Multilinear Entries and Learning Product Mixtures over the Hypercube

We give an algorithm for completing an order-m symmetric low-rank tensor...
research
05/15/2015

Optimal Low-Rank Tensor Recovery from Separable Measurements: Four Contractions Suffice

Tensors play a central role in many modern machine learning and signal p...
research
03/31/2018

Learning tensors from partial binary measurements

In this paper we generalize the 1-bit matrix completion problem to highe...
research
12/23/2016

Spectral algorithms for tensor completion

In the tensor completion problem, one seeks to estimate a low-rank tenso...
research
11/22/2019

Visual Relationship Detection with Low Rank Non-Negative Tensor Decomposition

We address the problem of Visual Relationship Detection (VRD) which aims...
research
07/07/2015

Rethinking LDA: moment matching for discrete ICA

We consider moment matching techniques for estimation in Latent Dirichle...
research
10/09/2020

Concurrent Alternating Least Squares for multiple simultaneous Canonical Polyadic Decompositions

Tensor decompositions, such as CANDECOMP/PARAFAC (CP), are widely used i...

Please sign up or login with your details

Forgot password? Click here to reset