Sparse Dimensionality Reduction Revisited

The sparse Johnson-Lindenstrauss transform is one of the central techniques in dimensionality reduction. It supports embedding a set of n points in ℝ^d into m=O(ε^-2 n) dimensions while preserving all pairwise distances to within 1 ±ε. Each input point x is embedded to Ax, where A is an m × d matrix having s non-zeros per column, allowing for an embedding time of O(s x_0). Since the sparsity of A governs the embedding time, much work has gone into improving the sparsity s. The current state-of-the-art by Kane and Nelson (JACM'14) shows that s = O(ε ^-1 n) suffices. This is almost matched by a lower bound of s = Ω(ε ^-1 n/(1/ε)) by Nelson and Nguyen (STOC'13). Previous work thus suggests that we have near-optimal embeddings. In this work, we revisit sparse embeddings and identify a loophole in the lower bound. Concretely, it requires d ≥ n, which in many applications is unrealistic. We exploit this loophole to give a sparser embedding when d = o(n), achieving s = O(ε^-1( n/(1/ε)+^2/3n ^1/3 d)). We also complement our analysis by strengthening the lower bound of Nelson and Nguyen to hold also when d ≪ n, thereby matching the first term in our new sparsity upper bound. Finally, we also improve the sparsity of the best oblivious subspace embeddings for optimal embedding dimensionality.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2022

The Fast Johnson-Lindenstrauss Transform is Even Faster

The seminal Fast Johnson-Lindenstrauss (Fast JL) transform by Ailon and ...
research
12/06/2022

Higher Lower Bounds for Sparse Oblivious Subspace Embeddings

An oblivious subspace embedding (OSE), characterized by parameters m,n,d...
research
01/07/2019

On the Dimensionality of Embeddings for Sparse Features and Data

In this note we discuss a common misconception, namely that embeddings a...
research
10/22/2018

Optimal terminal dimensionality reduction in Euclidean space

Let ε∈(0,1) and X⊂ R^d be arbitrary with |X| having size n>1. The Johnso...
research
04/27/2021

Exponentially Improved Dimensionality Reduction for ℓ_1: Subspace Embeddings and Independence Testing

Despite many applications, dimensionality reduction in the ℓ_1-norm is m...
research
07/07/2022

Barriers for Faster Dimensionality Reduction

The Johnson-Lindenstrauss transform allows one to embed a dataset of n p...
research
11/11/2013

Toward a unified theory of sparse dimensionality reduction in Euclidean space

Let Φ∈R^m× n be a sparse Johnson-Lindenstrauss transform [KN14] with s n...

Please sign up or login with your details

Forgot password? Click here to reset