A Nearly Optimal Size Coreset Algorithm with Nearly Linear Time

10/15/2022
by   Yichuan Deng, et al.
0

A coreset is a point set containing information about geometric properties of a larger point set. A series of previous works show that in many machine learning problems, especially in clustering problems, coreset could be very useful to build efficient algorithms. Two main measures of an coreset construction algorithm's performance are the running time of the algorithm and the size of the coreset output by the algorithm. In this paper we study the construction of coresets for the (k,z)-clustering problem, which is a generalization of k-means and k-median problem. By properly designing a sketching-based distance estimation data structure, we propose faster algorithms that construct coresets with matching size of the state-of-the-art results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/01/2018

Sensitivity Sampling Over Dynamic Geometric Data Streams with Applications to k-Clustering

Sensitivity based sampling is crucial for constructing nearly-optimal co...
research
12/20/2018

Near-Linear Time Approximation Schemes for Clustering in Doubling Metrics

We consider the classic Facility Location, k-Median, and k-Means problem...
research
12/27/2019

Strong Coresets for Subspace Approximation and k-Median in Nearly Linear Time

Recently the first (1+ϵ)-approximate strong coresets for k-median and su...
research
06/16/2023

Nearly-Optimal Hierarchical Clustering for Well-Clustered Graphs

This paper presents two efficient hierarchical clustering (HC) algorithm...
research
06/05/2023

Near-Optimal Quantum Coreset Construction Algorithms for Clustering

k-Clustering in ℝ^d (e.g., k-median and k-means) is a fundamental machin...
research
06/07/2019

Robust subgaussian estimation of a mean vector in nearly linear time

We construct an algorithm, running in nearly-linear time, which is robus...
research
04/28/2023

Faster Submodular Maximization for Several Classes of Matroids

The maximization of submodular functions have found widespread applicati...

Please sign up or login with your details

Forgot password? Click here to reset