A Greedy and Optimistic Approach to Clustering with a Specified Uncertainty of Covariates

04/18/2022
by   Akifumi Okuno, et al.
0

In this study, we examine a clustering problem in which the covariates of each individual element in a dataset are associated with an uncertainty specific to that element. More specifically, we consider a clustering approach in which a pre-processing applying a non-linear transformation to the covariates is used to capture the hidden data structure. To this end, we approximate the sets representing the propagated uncertainty for the pre-processed features empirically. To exploit the empirical uncertainty sets, we propose a greedy and optimistic clustering (GOC) algorithm that finds better feature candidates over such sets, yielding more condensed clusters. As an important application, we apply the GOC algorithm to synthetic datasets of the orbital properties of stars generated through our numerical simulation mimicking the formation process of the Milky Way. The GOC algorithm demonstrates an improved performance in finding sibling stars originating from the same dwarf galaxy. These realistic datasets have also been made publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/16/2021

K-expectiles clustering

K-means clustering is one of the most widely-used partitioning algorithm...
research
11/15/2017

Parsimonious Model-Based Clustering with Covariates

In model-based clustering methods using finite mixture models, the clust...
research
04/12/2021

A smoothed and probabilistic PARAFAC model with covariates

Analysis and clustering of multivariate time-series data attract growing...
research
05/22/2014

Semi-supervised Spectral Clustering for Classification

We propose a Classification Via Clustering (CVC) algorithm which enables...
research
10/27/2016

PCM and APCM Revisited: An Uncertainty Perspective

In this paper, we take a new look at the possibilistic c-means (PCM) and...
research
07/10/2016

Convex Relaxation for Community Detection with Covariates

Community detection in networks is an important problem in many applied ...

Please sign up or login with your details

Forgot password? Click here to reset