Sublinear Time Algorithms for Several Geometric Optimization (With Outliers) Problems In Machine Learning

01/07/2023
by   Hu Ding, et al.
0

In this paper, we study several important geometric optimization problems arising in machine learning. First, we revisit the Minimum Enclosing Ball (MEB) problem in Euclidean space ℝ^d. The problem has been extensively studied before, but real-world machine learning tasks often need to handle large-scale datasets so that we cannot even afford linear time algorithms. Motivated by the recent studies on beyond worst-case analysis, we introduce the notion of stability for MEB, which is natural and easy to understand. Roughly speaking, an instance of MEB is stable, if the radius of the resulting ball cannot be significantly reduced by removing a small fraction of the input points. Under the stability assumption, we present two sampling algorithms for computing radius-approximate MEB with sample complexities independent of the number of input points n. In particular, the second algorithm has the sample complexity even independent of the dimensionality d. We also consider the general case without the stability assumption. We present a hybrid algorithm that can output either a radius-approximate MEB or a covering-approximate MEB. Our algorithm improves the running time and the number of passes for the previous sublinear MEB algorithms. Our method relies on two novel techniques, the Uniform-Adaptive Sampling method and Sandwich Lemma. Furthermore, we observe that these two techniques can be generalized to design sublinear time algorithms for a broader range of geometric optimization problems with outliers in high dimensions, including MEB with outliers, one-class and two-class linear SVMs with outliers, k-center clustering with outliers, and flat fitting with outliers. Our proposed algorithms also work fine for kernels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2019

Minimum Enclosing Ball Revisited: Stability, Sub-linear Time Algorithms, and Extension

In this paper, we revisit the Minimum Enclosing Ball (MEB) problem and i...
research
04/20/2020

A Sub-linear Time Framework for Geometric Optimization with Outliers in High Dimensions

Many real-world problems can be formulated as geometric optimization pro...
research
04/08/2019

Minimum Enclosing Ball Revisited: Stability and Sub-linear Time Algorithms

In this paper, we revisit the Minimum Enclosing Ball (MEB) problem and i...
research
03/12/2023

Sparse Higher Order Čech Filtrations

For a finite set of balls of radius r, the k-fold cover is the space cov...
research
02/27/2020

Layered Sampling for Robust Optimization Problems

In real world, our datasets often contain outliers. Moreover, the outlie...
research
12/03/2019

Fast Algorithms for Geometric Consensuses

Let P be a set of n points in ^d in general position. A median hyperplan...
research
04/25/2018

Solving Minimum Enclosing Ball with Outliers: Algorithm, Implementation, and Application

Motivated by the arising realistic issues in big data, the problem of Mi...

Please sign up or login with your details

Forgot password? Click here to reset