A Centroid Auto-Fused Hierarchical Fuzzy c-Means Clustering

04/27/2020
by   Yunxia Lin, et al.
20

Like k-means and Gaussian Mixture Model (GMM), fuzzy c-means (FCM) with soft partition has also become a popular clustering algorithm and still is extensively studied. However, these algorithms and their variants still suffer from some difficulties such as determination of the optimal number of clusters which is a key factor for clustering quality. A common approach for overcoming this difficulty is to use the trial-and-validation strategy, i.e., traversing every integer from large number like √(n) to 2 until finding the optimal number corresponding to the peak value of some cluster validity index. But it is scarcely possible to naturally construct an adaptively agglomerative hierarchical cluster structure as using the trial-and-validation strategy. Even possible, existing different validity indices also lead to different number of clusters. To effectively mitigate the problems while motivated by convex clustering, in this paper we present a Centroid Auto-Fused Hierarchical Fuzzy c-means method (CAF-HFCM) whose optimization procedure can automatically agglomerate to form a cluster hierarchy, more importantly, yielding an optimal number of clusters without resorting to any validity index. Although a recently-proposed robust-learning fuzzy c-means (RL-FCM) can also automatically obtain the best number of clusters without the help of any validity index, so-involved 3 hyper-parameters need to adjust expensively, conversely, our CAF-HFCM involves just 1 hyper-parameter which makes the corresponding adjustment is relatively easier and more operational. Further, as an additional benefit from our optimization objective, the CAF-HFCM effectively reduces the sensitivity to the initialization of clustering performance. Moreover, our proposed CAF-HFCM method is able to be straightforwardly extended to various variants of FCM.

READ FULL TEXT

page 2

page 3

page 4

page 6

page 7

page 8

page 10

page 11

research
05/19/2020

A New Validity Index for Fuzzy-Possibilistic C-Means Clustering

In some complicated datasets, due to the presence of noisy data points a...
research
06/05/2018

A Visual Quality Index for Fuzzy C-Means

Cluster analysis is widely used in the areas of machine learning and dat...
research
08/01/2018

MaxMin Linear Initialization for Fuzzy C-Means

Clustering is an extensive research area in data science. The aim of clu...
research
06/17/2022

ck-means, a novel unsupervised learning method that combines fuzzy and crispy clustering methods to extract intersecting data

Clustering data is a popular feature in the field of unsupervised machin...
research
11/03/2016

A-Ward_pe̱ṯa̱: Effective hierarchical clustering using the Minkowski metric and a fast k -means initialisation

In this paper we make two novel contributions to hierarchical clustering...
research
11/27/2019

K-MACE and Kernel K-MACE Clustering

Determining the correct number of clusters (CNC) is an important task in...
research
08/28/2023

A correlation-based fuzzy cluster validity index with secondary options detector

The optimal number of clusters is one of the main concerns when applying...

Please sign up or login with your details

Forgot password? Click here to reset