Learning Distributed and Fair Policies for Network Load Balancing as Markov Potential Game

06/03/2022
by   Zhiyuan Yao, et al.
0

This paper investigates the network load balancing problem in data centers (DCs) where multiple load balancers (LBs) are deployed, using the multi-agent reinforcement learning (MARL) framework. The challenges of this problem consist of the heterogeneous processing architecture and dynamic environments, as well as limited and partial observability of each LB agent in distributed networking systems, which can largely degrade the performance of in-production load balancing algorithms in real-world setups. Centralised-training-decentralised-execution (CTDE) RL scheme has been proposed to improve MARL performance, yet it incurs – especially in distributed networking systems, which prefer distributed and plug-and-play design scheme – additional communication and management overhead among agents. We formulate the multi-agent load balancing problem as a Markov potential game, with a carefully and properly designed workload distribution fairness as the potential function. A fully distributed MARL algorithm is proposed to approximate the Nash equilibrium of the game. Experimental evaluations involve both an event-driven simulator and real-world system, where the proposed MARL load balancing algorithm shows close-to-optimal performance in simulations, and superior results over in-production LBs in the real-world system.

READ FULL TEXT

page 17

page 18

research
01/27/2022

Multi-Agent Reinforcement Learning for Network Load Balancing in Data Center

This paper presents the network load balancing problem, a challenging re...
research
10/29/2021

Reinforced Workload Distribution Fairness

Network load balancers are central components in data centers, that dist...
research
03/14/2023

Multi-agent Attention Actor-Critic Algorithm for Load Balancing in Cellular Networks

In cellular networks, User Equipment (UE) handoff from one Base Station ...
research
10/27/2021

Towards Intelligent Load Balancing in Data Centers

Network load balancers are important components in data centers to provi...
research
12/07/2021

Attention-Based Model and Deep Reinforcement Learning for Distribution of Event Processing Tasks

Event processing is the cornerstone of the dynamic and responsive Intern...
research
05/31/2020

Centralized and Decentralized Non-Cooperative Load-Balancing Games among Competing Cloudlets

Edge computing servers like cloudlets from different service providers t...
research
08/24/2022

Efficient Data-Driven Network Functions

Cloud environments require dynamic and adaptive networking policies. It ...

Please sign up or login with your details

Forgot password? Click here to reset