Streaming Facility Location in High Dimension via New Geometric Hashing

by   Artur Czumaj, et al.

In Euclidean Uniform Facility Location, the input is a set of clients in ℝ^d and the goal is to place facilities to serve them, so as to minimize the total cost of opening facilities plus connecting the clients. We study the classical setting of dynamic geometric streams, where the clients are presented as a sequence of insertions and deletions of points in the grid {1,…,Δ}^d, and we focus on the high-dimensional regime, where the algorithm's space complexity must be polynomial (and certainly not exponential) in d·logΔ. We present a new algorithmic framework, based on importance sampling from the stream, for O(1)-approximation of the optimal cost using only poly(d·logΔ) space. This framework is easy to implement in two passes, one for sampling points and the other for estimating their contribution. Over random-order streams, we can extend this to a one-pass algorithm by using the two halves of the stream separately. Our main result, for arbitrary-order streams, computes O(d^1.5)-approximation in one pass by using the new framework but combining the two passes differently. This improves upon previous algorithms that either need space exponential in d or only guarantee O(d·log^2Δ)-approximation, and therefore our algorithms for high-dimensional streams are the first to avoid the O(logΔ)-factor in approximation that is inherent to the widely-used quadtree decomposition. Our improvement is achieved by introducing a novel geometric hashing scheme that maps points in ℝ^d into buckets of bounded diameter, with the key property that every point set of small-enough diameter is hashed into at most poly(d) distinct buckets. Finally, we complement our results by showing 1.085-approximation requires space exponential in poly(d·logΔ), even for insertion-only streams.


page 1

page 2

page 3

page 4


Streaming Euclidean Max-Cut: Dimension vs Data Reduction

Max-Cut is a fundamental problem that has been studied extensively in va...

Streaming Algorithms for Geometric Steiner Forest

We consider a natural generalization of the Steiner tree problem, the St...

Sensitivity Sampling Over Dynamic Geometric Data Streams with Applications to k-Clustering

Sensitivity based sampling is crucial for constructing nearly-optimal co...

Weighted Maximum Independent Set of Geometric Objects in Turnstile Streams

We study the Maximum Independent Set problem for geometric objects given...

Streaming Euclidean MST to a Constant Factor

We study streaming algorithms for the fundamental geometric problem of c...

High-Dimensional Geometric Streaming in Polynomial Space

Many existing algorithms for streaming geometric data analysis have been...

Improved Approximate Rips Filtrations with Shifted Integer Lattices and Cubical Complexes

Rips complexes are important structures for analyzing topological featur...

Please sign up or login with your details

Forgot password? Click here to reset