Learning to Share in Multi-Agent Reinforcement Learning

12/16/2021
by   Yuxuan Yi, et al.
0

In this paper, we study the problem of networked multi-agent reinforcement learning (MARL), where a number of agents are deployed as a partially connected network and each interacts only with nearby agents. Networked MARL requires all agents make decision in a decentralized manner to optimize a global objective with restricted communication between neighbors over the network. Inspired by the fact that sharing plays a key role in human's learning of cooperation, we propose LToS, a hierarchically decentralized MARL framework that enables agents to learn to dynamically share reward with neighbors so as to encourage agents to cooperate on the global objective. For each agent, the high-level policy learns how to share reward with neighbors to decompose the global objective, while the low-level policy learns to optimize local objective induced by the high-level policies in the neighborhood. The two policies form a bi-level optimization and learn alternately. We empirically demonstrate that LToS outperforms existing methods in both social dilemma and networked MARL scenario.

READ FULL TEXT
research
10/15/2020

Multi-Agent Trust Region Policy Optimization

We extend trust region policy optimization (TRPO) to multi-agent reinfor...
research
04/25/2023

SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning

Spatial information is essential in various fields. How to explicitly mo...
research
10/17/2016

Decentralized Collaborative Learning of Personalized Models over Networks

We consider a set of learning agents in a collaborative peer-to-peer net...
research
10/31/2019

Learning Fairness in Multi-Agent Systems

Fairness is essential for human society, contributing to stability and p...
research
09/30/2021

Decentralized Graph-Based Multi-Agent Reinforcement Learning Using Reward Machines

In multi-agent reinforcement learning (MARL), it is challenging for a co...
research
02/28/2023

IQ-Flow: Mechanism Design for Inducing Cooperative Behavior to Self-Interested Agents in Sequential Social Dilemmas

Achieving and maintaining cooperation between agents to accomplish a com...

Please sign up or login with your details

Forgot password? Click here to reset