The Supermarket Model with Known and Predicted Service Times

05/23/2019
by   Michael Mitzenmacher, et al.
0

The supermarket model typically refers to a system with a large number of queues, where arriving customers choose d queues at random and join the queue with fewest customers. The supermarket model demonstrates the power of even small amounts of choice, as compared to simply joining a queue chosen uniformly at random, for load balancing systems. In this work we perform simulation-based studies to consider variations where service times for a customer are predicted, as might be done in modern settings using machine learning techniques or related mechanisms. To begin, we start by considering the baseline where service times are known. We find that this allows for significant improvements. In particular, not only can the queue being joined be chosen based on the total work at the queue instead of the number of jobs, but also the jobs in the queue can be served using strategies that take advantage of the service times such as shortest job first or shortest remaining processing time. Such strategies greatly improve performance under high load. We then examine the impact of using predictions in place of true service times. Our main takeaway is that using even seemingly weak predictions of service times can yield significant benefits over blind First In First Out queueing in this context. However, some care must be taken when using predicted service time information to both choose a queue and order elements for service within a queue; while in many cases using the information for both choosing and ordering is beneficial, in many of our simulation settings we find that simply using the number of jobs to choose a queue is better when using predicted service times to order jobs in a queue. Our study leaves many natural open questions for further work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2019

Scheduling with Predictions and the Price of Misprediction

In many traditional job scheduling settings, it is assumed that one know...
research
11/16/2020

Improved Load Balancing in Large Scale Systems using Attained Service Time Reporting

Our interest lies in load balancing jobs in large scale systems consisti...
research
08/19/2020

End-to-End Predictions-Based Resource Management Framework for Supercomputer Jobs

Job submissions of parallel applications to production supercomputer sys...
research
04/16/2021

The generalized join the shortest orbit queue system: Stability, exact tail asymptotics and stationary approximations

We introduce the generalized join the shortest queue model with retrials...
research
04/28/2022

Predicting batch queue job wait times for informed scheduling of urgent HPC workloads

There is increasing interest in the use of HPC machines for urgent workl...
research
11/01/2022

Towards Maximizing Nonlinear Delay Sensitive Rewards in Queuing Systems

We consider maximizing the long-term average reward in a single server q...
research
02/23/2022

The Tiny-Tasks Granularity Trade-Off: Balancing overhead vs. performance in parallel systems

Models of parallel processing systems typically assume that one has l wo...

Please sign up or login with your details

Forgot password? Click here to reset