Tromino: Demand and DRF Aware Multi-Tenant Queue Manager for Apache Mesos Cluster

05/21/2019
by   Pankaj Saha, et al.
0

Apache Mesos, a two-level resource scheduler, provides resource sharing across multiple users in a multi-tenant cluster environment. Computational resources (i.e., CPU, memory, disk, etc. ) are distributed according to the Dominant Resource Fairness (DRF) policy. Mesos frameworks (users) receive resources based on their current usage and are responsible for scheduling their tasks within the allocation. We have observed that multiple frameworks can cause fairness imbalance in a multiuser environment. For example, a greedy framework consuming more than its fair share of resources can deny resource fairness to others. The user with the least Dominant Share is considered first by the DRF module to get its resource allocation. However, the default DRF implementation, in Apache Mesos' Master allocation module, does not consider the overall resource demands of the tasks in the queue for each user/framework. This lack of awareness can result in users without any pending task receiving more resource offers while users with a queue of pending tasks starve due to their high dominant shares. We have developed a policy-driven queue manager, Tromino, for an Apache Mesos cluster where tasks for individual frameworks can be scheduled based on each framework's overall resource demands and current resource consumption. Dominant Share and demand awareness of Tromino and scheduling based on these attributes can reduce (1) the impact of unfairness due to a framework specific configuration, and (2) unfair waiting time due to higher resource demand in a pending task queue. In the best case, Tromino can significantly reduce the average waiting time of a framework by using the proposed Demand-DRF aware policy.

READ FULL TEXT

page 1

page 3

page 8

research
05/21/2019

Exploring the Fairness and Resource Distribution in an Apache Mesos Environment

Apache Mesos, a cluster-wide resource manager, is widely deployed in mas...
research
12/29/2017

An Efficient and Fair Multi-Resource Allocation Mechanism for Heterogeneous Servers

Efficient and fair allocation of multiple types of resources is a crucia...
research
01/25/2022

Learning Resource Allocation Policies from Observational Data with an Application to Homeless Services Delivery

We study the problem of learning, from observational data, fair and inte...
research
08/24/2018

Performance evaluation of job schedulers on Hadoop YARN

To solve the limitation of Hadoop on scalability, resource sharing, and ...
research
01/29/2021

Fair Resource Allocation for Demands with Sharp Lower Tail Inequalities

We consider a fairness problem in resource allocation where multiple gro...
research
01/03/2014

A Framework for Creating a Distributed Rendering Environment on the Compute Clusters

This paper discusses the deployment of existing render farm manager in a...
research
12/15/2010

Customer Appeasement Scheduling

Almost all of the current process scheduling algorithms which are used i...

Please sign up or login with your details

Forgot password? Click here to reset