Performance evaluation of job schedulers on Hadoop YARN

08/24/2018
by   Jia-Chun Lin, et al.
0

To solve the limitation of Hadoop on scalability, resource sharing, and application support, the open-source community proposes the next generation of Hadoop's compute platform called Yet Another Resource Negotiator (YARN) by separating resource management functions from the programming model. This separation enables various application types to run on YARN in parallel. To achieve fair resource sharing and high resource utilization, YARN provides the capacity scheduler and the fair scheduler. However, the performance impacts of the two schedulers are not clear when mixed applications run on a YARN cluster. Therefore, in this paper, we study four scheduling-policy combinations (SPCs for short) derived from the two schedulers and then evaluate the four SPCs in extensive scenarios, which consider not only four application types, but also three different queue structures for organizing applications. The experimental results enable YARN managers to comprehend the influences of different SPCs and different queue structures on mixed applications. The results also help them to select a proper SPC and an appropriate queue structure to achieve better application execution performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2020

SAF: Simulated Annealing Fair Scheduling for Hadoop Yarn Clusters

Apache introduced YARN as the next generation of the Hadoop framework, p...
research
05/21/2019

Tromino: Demand and DRF Aware Multi-Tenant Queue Manager for Apache Mesos Cluster

Apache Mesos, a two-level resource scheduler, provides resource sharing ...
research
05/22/2019

Two stage cluster for resource optimization with Apache Mesos

As resource estimation for jobs is difficult, users often overestimate t...
research
01/25/2022

Learning Resource Allocation Policies from Observational Data with an Application to Homeless Services Delivery

We study the problem of learning, from observational data, fair and inte...
research
03/04/2019

Resource-sharing Policy in Multi-tenant Scientific Workflow as a Service Platform

Increasing adoption of scientific workflows in the community has urged f...
research
07/30/2019

Optimal Dynamic Multi-Resource Management in Earth Observation Oriented Space Information Networks

Space information network (SIN) is an innovative networking architecture...
research
06/20/2023

Fine-grained Policy-driven I/O Sharing for Burst Buffers

A burst buffer is a common method to bridge the performance gap between ...

Please sign up or login with your details

Forgot password? Click here to reset