Streaming Inference for Infinite Non-Stationary Clustering

by   Rylan Schaeffer, et al.

Learning from a continuous stream of non-stationary data in an unsupervised manner is arguably one of the most common and most challenging settings facing intelligent agents. Here, we attack learning under all three conditions (unsupervised, streaming, non-stationary) in the context of clustering, also known as mixture modeling. We introduce a novel clustering algorithm that endows mixture models with the ability to create new clusters online, as demanded by the data, in a probabilistic, time-varying, and principled manner. To achieve this, we first define a novel stochastic process called the Dynamical Chinese Restaurant Process (Dynamical CRP), which is a non-exchangeable distribution over partitions of a set; next, we show that the Dynamical CRP provides a non-stationary prior over cluster assignments and yields an efficient streaming variational inference algorithm. We conclude with experiments showing that the Dynamical CRP can be applied on diverse synthetic and real data with Gaussian and non-Gaussian likelihoods.


page 1

page 2

page 3

page 4


Dirichlet process mixture models for non-stationary data streams

In recent years, we have seen a handful of work on inference algorithms ...

Harmonizable mixture kernels with variational Fourier features

The expressive power of Gaussian processes depends heavily on the choice...

Learning Manifolds from Non-stationary Streaming Data

Streaming adaptations of manifold learning based dimensionality reductio...

Online Clustering by Penalized Weighted GMM

With the dawn of the Big Data era, data sets are growing rapidly. Data i...

Adaptive regularization for Lasso models in the context of non-stationary data streams

Large scale, streaming datasets are ubiquitous in modern machine learnin...

Modelling time evolving interactions in networks through a non stationary extension of stochastic block models

In this paper, we focus on the stochastic block model (SBM),a probabilis...

Towards the interpretation of time-varying regularization parameters in streaming penalized regression models

High-dimensional, streaming datasets are ubiquitous in modern applicatio...

Please sign up or login with your details

Forgot password? Click here to reset