Near-Optimal Massively Parallel Graph Connectivity

10/11/2019
by   Soheil Behnezhad, et al.
0

Identifying the connected components of a graph, apart from being a fundamental problem with countless applications, is a key primitive for many other algorithms. In this paper, we consider this problem in parallel settings. Particularly, we focus on the Massively Parallel Computations (MPC) model, which is the standard theoretical model for modern parallel frameworks such as MapReduce, Hadoop, or Spark. We consider the truly sublinear regime of MPC for graph problems where the space per machine is n^δ for some desirably small constant δ∈ (0, 1). We present an algorithm that for graphs with diameter D in the wide range [log^ϵ n, n], takes O(log D) rounds to identify the connected components and takes O(loglog n) rounds for all other graphs. The algorithm is randomized, succeeds with high probability, does not require prior knowledge of D, and uses an optimal total space of O(m). We complement this by showing a conditional lower-bound based on the widely believed TwoCycle conjecture that Ω(log D) rounds are indeed necessary in this setting. Studying parallel connectivity algorithms received a resurgence of interest after the pioneering work of Andoni et al. [FOCS 2018] who presented an algorithm with O(log D ·loglog n) round-complexity. Our algorithm improves this result for the whole range of values of D and almost settles the problem due to the conditional lower-bound. Additionally, we show that with minimal adjustments, our algorithm can also be implemented in a variant of the (CRCW) PRAM in asymptotically the same number of rounds.

READ FULL TEXT
research
08/09/2021

Deterministic Massively Parallel Connectivity

We consider the problem of designing fundamental graph algorithms on the...
research
02/08/2023

Adaptive Massively Parallel Connectivity in Optimal Space

We study the problem of finding connected components in the Adaptive Mas...
research
05/02/2019

Log Diameter Rounds Algorithms for 2-Vertex and 2-Edge Connectivity

Many modern parallel systems, such as MapReduce, Hadoop and Spark, can b...
research
11/07/2022

Optimal Deterministic Massively Parallel Connectivity on Forests

We show fast deterministic algorithms for fundamental problems on forest...
research
05/08/2018

Massively Parallel Algorithms for Finding Well-Connected Components in Sparse Graphs

A fundamental question that shrouds the emergence of massively parallel ...
research
02/17/2020

How fast can you update your MST? (Dynamic algorithms for cluster computing)

Imagine a large graph that is being processed by a cluster of computers,...
research
04/18/2019

A Conditional Lower Bound on Graph Connectivity in MapReduce

MapReduce (and its open source implementation Hadoop) has become the de ...

Please sign up or login with your details

Forgot password? Click here to reset