Parameter Sharing Reinforcement Learning Architecture for Multi Agent Driving Behaviors

11/17/2018
by   Meha Kaushik, et al.
0

Multi-agent learning provides a potential framework for learning and simulating traffic behaviors. This paper proposes a novel architecture to learn multiple driving behaviors in a traffic scenario. The proposed architecture can learn multiple behaviors independently as well as simultaneously. We take advantage of the homogeneity of agents and learn in a parameter sharing paradigm. To further speed up the training process asynchronous updates are employed into the architecture. While learning different behaviors simultaneously, the given framework was also able to learn cooperation between the agents, without any explicit communication. We applied this framework to learn two important behaviors in driving: 1) Lane-Keeping and 2) Over-Taking. Results indicate faster convergence and learning of a more generic behavior, that is scalable to any number of agents. When compared the results with existing approaches, our results indicate equal and even better performance in some cases.

READ FULL TEXT
research
07/12/2022

Reward-Sharing Relational Networks in Multi-Agent Reinforcement Learning as a Framework for Emergent Behavior

In this work, we integrate `social' interactions into the MARL setup thr...
research
03/02/2023

Parameter Sharing with Network Pruning for Scalable Multi-Agent Deep Reinforcement Learning

Handling the problem of scalability is one of the essential issues for m...
research
01/29/2020

Variational Autoencoders for Opponent Modeling in Multi-Agent Systems

Multi-agent systems exhibit complex behaviors that emanate from the inte...
research
12/14/2020

SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning

A characteristic of reinforcement learning is the ability to develop unf...
research
09/29/2021

Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning

In multi-agent deep reinforcement learning, extracting sufficient and co...
research
10/26/2021

Learning to Simulate Self-Driven Particles System with Coordinated Policy Optimization

Self-Driven Particles (SDP) describe a category of multi-agent systems c...
research
03/22/2021

Learning to Robustly Negotiate Bi-Directional Lane Usage in High-Conflict Driving Scenarios

Recently, autonomous driving has made substantial progress in addressing...

Please sign up or login with your details

Forgot password? Click here to reset