ASYMP: Fault-tolerant Mining of Massive Graphs

12/28/2017
by   Eduardo Fleury, et al.
0

We present ASYMP, a distributed graph processing system developed for the timely analysis of graphs with trillions of edges. ASYMP has several distinguishing features including a robust fault tolerance mechanism, a lockless architecture which scales seamlessly to thousands of machines, and efficient data access patterns to reduce per-machine overhead. ASYMP is used to analyze the largest graphs at Google, and the graphs we consider in our empirical evaluation here are, to the best of our knowledge, the largest considered in the literature. Our experimental results show that compared to previous graph processing frameworks at Google, ASYMP can scale to larger graphs, operate on more crowded clusters, and complete real-world graph mining analytic tasks faster. First, we evaluate the speed of ASYMP, where we show that across a diverse selection of graphs, it runs Connected Component 3-50x faster than state of the art implementations in MapReduce and Pregel. Then we demonstrate the scalability and parallelism of this framework: first by showing that the running time increases linearly by increasing the size of the graphs (without changing the number of machines), and then by showing the gains in running time while increasing the number of machines. Finally, we demonstrate the fault-tolerance properties for the framework, showing that inducing 50 increases the running time by only 41

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2019

Circle Graph Isomorphism in Almost Linear Time

Circle graphs are intersection graphs of chords of a circle. In this pap...
research
07/27/2018

Connected Components at Scale via Local Contractions

As a fundamental tool in hierarchical graph clustering, computing connec...
research
12/31/2021

Inexact Graph Matching Using Centrality Measures

Graph matching is the process of computing the similarity between two gr...
research
03/07/2018

Adaptive 3D-IC TSV Fault Tolerance Structure Generation

In three dimensional integrated circuits (3D-ICs), through silicon via (...
research
04/30/2020

Scalable Mining of Maximal Quasi-Cliques: An Algorithm-System Codesign Approach

Given a user-specified minimum degree threshold γ, a γ-quasi-clique is a...
research
10/13/2020

Fault-tolerant and Transactional Stateful Serverless Workflows (extended version)

This paper introduces Beldi, a library and runtime system for writing an...

Please sign up or login with your details

Forgot password? Click here to reset