Worst-Case and Smoothed Analysis of Hartigan's Method for k-Means Clustering

09/19/2023
by   Bodo Manthey, et al.
0

We analyze the running time of Hartigan's method, an old algorithm for the k-means clustering problem. First, we construct an instance on the line on which the method can take 2^Ω(n) steps to converge, demonstrating that Hartigan's method has exponential worst-case running time even when k-means is easy to solve. As this is in contrast to the empirical performance of the algorithm, we also analyze the running time in the framework of smoothed analysis. In particular, given an instance of n points in d dimensions, we prove that the expected number of iterations needed for Hartigan's method to terminate is bounded by k^12kd·(n, k, d, 1/σ) when the points in the instance are perturbed by independent d-dimensional Gaussian random variables of mean 0 and standard deviation σ.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2018

Constant Factor Time Optimal Multi-Robot Routing on High-Dimensional Grids in Mostly Sub-Quadratic Time

Let G = (V, E) be an m_1 ×...× m_k grid. Assuming that each v ∈ V is occ...
research
08/01/2023

Smoothed Analysis of the 2-Opt Heuristic for the TSP under Gaussian Noise

The 2-opt heuristic is a very simple local search heuristic for the trav...
research
02/14/2023

Worst Case and Probabilistic Analysis of the 2-Opt Algorithm for the TSP

2-Opt is probably the most basic local search heuristic for the TSP. Thi...
research
07/30/2019

Parallelization of Kmeans++ using CUDA

K-means++ is an algorithm which is invented to improve the process of fi...
research
04/13/2022

Population Diversity Leads to Short Running Times of Lexicase Selection

In this paper we investigate why the running time of lexicase parent sel...
research
12/19/2017

Linear Time Clustering for High Dimensional Mixtures of Gaussian Clouds

Clustering mixtures of Gaussian distributions is a fundamental and chall...
research
04/29/2017

A Partitioning Algorithm for Detecting Eventuality Coincidence in Temporal Double recurrence

A logical theory of regular double or multiple recurrence of eventualiti...

Please sign up or login with your details

Forgot password? Click here to reset