Maiter: An Asynchronous Graph Processing Framework for Delta-based Accumulative Iterative Computation

10/16/2017
by   Yanfeng Zhang, et al.
0

Myriad of graph-based algorithms in machine learning and data mining require parsing relational data iteratively. These algorithms are implemented in a large-scale distributed environment in order to scale to massive data sets. To accelerate these large-scale graph-based iterative computations, we propose delta-based accumulative iterative computation (DAIC). Different from traditional iterative computations, which iteratively update the result based on the result from the previous iteration, DAIC updates the result by accumulating the "changes" between iterations. By DAIC, we can process only the "changes" to avoid the negligible updates. Furthermore, we can perform DAIC asynchronously to bypass the high-cost synchronous barriers in heterogeneous distributed environments. Based on the DAIC model, we design and implement an asynchronous graph processing framework, Maiter. We evaluate Maiter on local cluster as well as on Amazon EC2 Cloud. The results show that Maiter achieves as much as 60x speedup over Hadoop and outperforms other state-of-the-art frameworks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/29/2021

Delayed Asynchronous Iterative Graph Algorithms

Iterative graph algorithms often compute intermediate values and update ...
research
04/15/2023

Layph: Making Change Propagation Constraint in Incremental Graph Processing by Layering Graph

Real-world graphs are constantly evolving, which demands updates of the ...
research
05/22/2017

An Asynchronous Distributed Framework for Large-scale Learning Based on Parameter Exchanges

In many distributed learning problems, the heterogeneous loading of comp...
research
05/31/2023

Auto-Differentiation of Relational Computations for Very Large Scale Machine Learning

The relational data model was designed to facilitate large-scale data ma...
research
12/05/2017

Analyzing Large-Scale, Distributed and Uncertain Data

The exponential growth of data in current times and the demand to gain i...
research
02/06/2018

Aggregate Graph Statistics

Collecting statistic from graph-based data is an increasingly studied to...
research
07/18/2018

Evolving Large-Scale Data Stream Analytics based on Scalable PANFIS

Many distributed machine learning frameworks have recently been built to...

Please sign up or login with your details

Forgot password? Click here to reset