Multi-Agent Reinforcement Learning via Mean Field Control: Common Noise, Major Agents and Approximation Properties

03/19/2023
by   Kai Cui, et al.
0

Recently, mean field control (MFC) has provided a tractable and theoretically founded approach to otherwise difficult cooperative multi-agent control. However, the strict assumption of many independent, homogeneous agents may be too stringent in practice. In this work, we propose a novel discrete-time generalization of Markov decision processes and MFC to both many minor agents and potentially complex major agents – major-minor mean field control (M3FC). In contrast to deterministic MFC, M3FC allows for stochastic minor agent distributions with strong correlation between minor agents through the major agent state, which can model arbitrary problem details not bound to any agent. Theoretically, we give rigorous approximation properties with novel proofs for both M3FC and existing MFC models in the finite multi-agent problem, together with a dynamic programming principle for solving such problems. In the infinite-horizon discounted case, existence of an optimal stationary policy follows. Algorithmically, we propose the major-minor mean field proximal policy optimization algorithm (M3FPPO) as a novel multi-agent reinforcement learning algorithm and demonstrate its success in illustrative M3FC-type problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2021

Discrete-Time Mean Field Control with Environment States

Multi-agent reinforcement learning methods have shown remarkable potenti...
research
09/09/2021

On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)

Mean field control (MFC) is an effective way to mitigate the curse of di...
research
03/13/2021

Mean Field Behaviour of Collaborative Multi-Agent Foragers

Collaborative multi-agent robotic systems where agents coordinate by mod...
research
09/12/2022

Mean-Field Control Approach to Decentralized Stochastic Control with Finite-Dimensional Memories

Decentralized stochastic control (DSC) considers the optimal control pro...
research
04/24/2020

Decentralized linear quadratic systems with major and minor agents and non-Gaussian noise

We consider a decentralized linear quadratic system with a major agent a...
research
10/04/2022

Robust feedback stabilization of interacting multi-agent systems under uncertainty

We consider control strategies for large-scale interacting agent systems...
research
03/06/2022

Depthwise Convolution for Multi-Agent Communication with Enhanced Mean-Field Approximation

Multi-agent settings remain a fundamental challenge in the reinforcement...

Please sign up or login with your details

Forgot password? Click here to reset