Optimal Clustering under Uncertainty

06/02/2018
by   Lori A. Dalton, et al.
0

Classical clustering algorithms typically either lack an underlying probability framework to make them predictive or focus on parameter estimation rather than defining and minimizing a notion of error. Recent work addresses these issues by developing a probabilistic framework based on the theory of random labeled point processes and characterizing a Bayes clusterer that minimizes the number of misclustered points. The Bayes clusterer is analogous to the Bayes classifier. Whereas determining a Bayes classifier requires full knowledge of the feature-label distribution, deriving a Bayes clusterer requires full knowledge of the point process. When uncertain of the point process, one would like to find a robust clusterer that is optimal over the uncertainty, just as one may find optimal robust classifiers with uncertain feature-label distributions. Herein, we derive an optimal robust clusterer by first finding an effective random point process that incorporates all randomness within its own probabilistic structure and from which a Bayes clusterer can be derived that provides an optimal robust clusterer relative to the uncertainty. This is analogous to the use of effective class-conditional distributions in robust classification. After evaluating the performance of robust clusterers in synthetic mixtures of Gaussians models, we apply the framework to granular imaging, where we make use of the asymptotic granulometric moment theory for granular images to relate robust clustering theory to the application.

READ FULL TEXT
research
05/15/2011

Bounds on the Bayes Error Given Moments

We show how to compute lower bounds for the supremum Bayes error if the ...
research
08/30/2022

Empirical and Full Bayes estimation of the type of a Pitman-Yor process

The Pitman-Yor process is a random discrete probability distribution of ...
research
06/09/2020

A generalized Bayes framework for probabilistic clustering

Loss-based clustering methods, such as k-means and its variants, are sta...
research
02/20/2020

A Bayes-Optimal View on Adversarial Examples

The ability to fool modern CNN classifiers with tiny perturbations of th...
research
06/17/2020

Universal Lower-Bounds on Classification Error under Adversarial Attacks and Random Corruption

We theoretically analyse the limits of robustness to test-time adversari...
research
03/27/2013

Practical Issues in Constructing a Bayes' Belief Network

Bayes belief networks and influence diagrams are tools for constructing ...
research
01/29/2021

Optimal strategies for reject option classifiers

In classification with a reject option, the classifier is allowed in unc...

Please sign up or login with your details

Forgot password? Click here to reset