In Search of a Fast and Efficient Serverless DAG Engine

10/14/2019
by   Benjamin Carver, et al.
0

Python-written data analytics applications can be modeled as and compiled into a directed acyclic graph (DAG) based workflow, where the nodes are fine-grained tasks and the edges are task dependencies. Such analytics workflow jobs are increasingly characterized by short, fine-grained tasks with large fan-outs. These characteristics make them well-suited for a new cloud computing model called serverless computing or Function-as-a-Service (FaaS), which has become prevalent in recent years. The auto-scaling property of serverless computing platforms accommodates short tasks and bursty workloads, while the pay-per-use billing model of serverless computing providers keeps the cost of short tasks low. In this paper, we thoroughly investigate the problem space of DAG scheduling in serverless computing. We identify and evaluate a set of techniques to make DAG schedulers serverless-aware. These techniques have been implemented in Wukong, a serverless, DAG scheduler attuned to AWS Lambda. Wukong provides decentralized scheduling through a combination of static and dynamic scheduling. We present the results of an empirical study in which Wukong is applied to a range of microbenchmark and real-world DAG applications. Results demonstrate the efficacy of Wukong in minimizing the performance overhead introduced by AWS Lambda — Wukong achieves competitive performance compared to a serverful DAG scheduler, while improving the performance of real-world DAG jobs by as much as 3.1X at larger scale.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2018

Resource Provisioning and Scheduling Algorithm for Meeting Cost and Deadline-Constraints of Scientific Workflows in IaaS Clouds

Infrastructure as a Service model of cloud computing is a desirable plat...
research
03/09/2021

Learning to Optimize DAG Scheduling in Heterogeneous Environment

Directed Acyclic Graph (DAG) scheduling in a heterogeneous environment i...
research
12/23/2019

Jupiter: A Networked Computing Architecture

In the era of Internet of Things, there is an increasing demand for netw...
research
10/14/2020

Wukong: A Scalable and Locality-Enhanced Framework for Serverless Parallel Computing

Serverless computing is increasingly being used for parallel computing, ...
research
08/31/2023

BeeFlow: Behavior Tree-based Serverless Workflow Modeling and Scheduling for Resource-Constrained Edge Clusters

Serverless computing has gained popularity in edge computing due to its ...
research
05/19/2018

Robust Optimization over Multiple Domains

Recently, machine learning becomes important for the cloud computing ser...
research
12/18/2022

CEDCES: A Cost Effective Deadline Constrained Evolutionary Scheduler for Task Graphs in Multi-Cloud System

Many scientific workflows can be modeled as a Directed Acyclic Graph (he...

Please sign up or login with your details

Forgot password? Click here to reset