Refining a k-nearest neighbor graph for a computationally efficient spectral clustering

02/22/2023
by   Mashaan Alshammari, et al.
0

Spectral clustering became a popular choice for data clustering for its ability of uncovering clusters of different shapes. However, it is not always preferable over other clustering methods due to its computational demands. One of the effective ways to bypass these computational demands is to perform spectral clustering on a subset of points (data representatives) then generalize the clustering outcome, this is known as approximate spectral clustering (ASC). ASC uses sampling or quantization to select data representatives. This makes it vulnerable to 1) performance inconsistency (since these methods have a random step either in initialization or training), 2) local statistics loss (because the pairwise similarities are extracted from data representatives instead of data points). We proposed a refined version of k-nearest neighbor graph, in which we keep data points and aggressively reduce number of edges for computational efficiency. Local statistics were exploited to keep the edges that do not violate the intra-cluster distances and nullify all other edges in the k-nearest neighbor graph. We also introduced an optional step to automatically select the number of clusters C. The proposed method was tested on synthetic and real datasets. Compared to ASC methods, the proposed method delivered a consistent performance despite significant reduction of edges.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/09/2013

Spectral Clustering Based on Local PCA

We propose a spectral clustering method based on local principal compone...
research
02/25/2023

A parameter-free graph reduction for spectral clustering and SpectralNet

Graph-based clustering methods like spectral clustering and SpectralNet ...
research
02/22/2023

Approximate spectral clustering density-based similarity for noisy datasets

Approximate spectral clustering (ASC) was developed to overcome heavy co...
research
04/20/2015

Nonparametric Nearest Neighbor Random Process Clustering

We consider the problem of clustering noisy finite-length observations o...
research
02/08/2022

Systematically improving existing k-means initialization algorithms at nearly no cost, by pairwise-nearest-neighbor smoothing

We present a meta-method for initializing (seeding) the k-means clusteri...
research
05/06/2015

Geometry-Aware Neighborhood Search for Learning Local Models for Image Reconstruction

Local learning of sparse image models has proven to be very effective to...
research
04/28/2021

SMLSOM: The shrinking maximum likelihood self-organizing map

Determining the number of clusters in a dataset is a fundamental issue i...

Please sign up or login with your details

Forgot password? Click here to reset