A Framework for dynamically meeting performance objectives on a service mesh

06/25/2023
by   Forough Shahab Samani, et al.
0

We present a framework for achieving end-to-end management objectives for multiple services that concurrently execute on a service mesh. We apply reinforcement learning (RL) techniques to train an agent that periodically performs control actions to reallocate resources. We develop and evaluate the framework using a laboratory testbed where we run information and computing services on a service mesh, supported by the Istio and Kubernetes platforms. We investigate different management objectives that include end-to-end delay bounds on service requests, throughput objectives, cost-related objectives, and service differentiation. We compute the control policies on a simulator rather than on the testbed, which speeds up the training time by orders of magnitude for the scenarios we study. Our proposed framework is novel in that it advocates a top-down approach whereby the management objectives are defined first and then mapped onto the available control actions. It allows us to execute several types of control actions simultaneously. By first learning the system model and the operating region from testbed traces, we can train the agent for different management objectives in parallel.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2022

Dynamically meeting performance objectives for multiple services on a service mesh

We present a framework that lets a service provider achieve end-to-end m...
research
08/21/2020

Reinforcement Learning-based Admission Control in Delay-sensitive Service Systems

Ensuring quality of service (QoS) guarantees in service systems is a cha...
research
03/02/2023

Domain Adaptation of Reinforcement Learning Agents based on Network Service Proximity

The dynamic and evolutionary nature of service requirements in wireless ...
research
01/12/2021

Queue-Learning: A Reinforcement Learning Approach for Providing Quality of Service

End-to-end delay is a critical attribute of quality of service (QoS) in ...
research
07/14/2021

QoS-Aware Scheduling in New Radio Using Deep Reinforcement Learning

Fifth-generation (5G) New Radio (NR) cellular networks support a wide ra...
research
05/17/2017

Identification and Off-Policy Learning of Multiple Objectives Using Adaptive Clustering

In this work, we present a methodology that enables an agent to make eff...
research
08/19/2020

FIRM: An Intelligent Fine-Grained Resource Management Framework for SLO-Oriented Microservices

Modern user-facing latency-sensitive web services include numerous distr...

Please sign up or login with your details

Forgot password? Click here to reset