Parallel Algorithms for Small Subgraph Counting

02/19/2020
by   Amartya Shankha Biswas, et al.
0

Subgraph counting is a fundamental problem in analyzing massive graphs, often studied in the context of social and complex networks. There is a rich literature on designing efficient, accurate, and scalable algorithms for this problem. In this work, we tackle this challenge and design several new algorithms for subgraph counting in the Massively Parallel Computation (MPC) model: Given a graph G over n vertices, m edges and T triangles, our first main result is an algorithm that, with high probability, outputs a (1+ε)-approximation to T, with optimal round and space complexity provided any S ≥max(√(m), n^2/m) space per machine, assuming T=Ω(√(m/n)). Our second main result is an Õ_δ(loglog n)-rounds algorithm for exactly counting the number of triangles, parametrized by the arboricity α of the input graph. The space per machine is O(n^δ) for any constant δ, and the total space is O(mα), which matches the time complexity of (combinatorial) triangle counting in the sequential model. We also prove that this result can be extended to exactly counting k-cliques for any constant k, with the same round complexity and total space O(mα^k-2). Alternatively, allowing O(α^2) space per machine, the total space requirement reduces to O(nα^2). Finally, we prove that a recent result of Bera, Pashanasangi and Seshadhri (ITCS 2020) for exactly counting all subgraphs of size at most 5, can be implemented in the MPC model in Õ_δ(√(log n)) rounds, O(n^δ) space per machine and O(mα^3) total space. Therefore, this result also exhibits the phenomenon that a time bound in the sequential model translates to a space bound in the MPC model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2023

Massively Parallel Algorithms for the Stochastic Block Model

Learning the community structure of a large-scale graph is a fundamental...
research
06/01/2023

Time and Space Optimal Massively Parallel Algorithm for the 2-Ruling Set Problem

In this work, we present a constant-round algorithm for the 2-ruling set...
research
08/14/2020

On the Hardness of Massively Parallel Computation

We investigate whether there are inherent limits of parallelization in t...
research
11/30/2020

A Near-Optimal Parallel Algorithm for Joining Binary Relations

We present a constant-round algorithm in the massively parallel computat...
research
02/08/2023

Adaptive Massively Parallel Connectivity in Optimal Space

We study the problem of finding connected components in the Adaptive Mas...
research
05/07/2022

Almost 3-Approximate Correlation Clustering in Constant Rounds

We study parallel algorithms for correlation clustering. Each pair among...
research
05/03/2022

Differentially Private Triangle and 4-Cycle Counting in the Shuffle Model

Subgraph counting is fundamental for analyzing connection patterns or cl...

Please sign up or login with your details

Forgot password? Click here to reset