TD-Regularized Actor-Critic Methods

12/19/2018
by   Simone Parisi, et al.
4

Actor-critic methods can achieve incredible performance on difficult reinforcement learning problems, but they are also prone to instability. This is partly due to the interaction between the actor and critic during learning, e.g., an inaccurate step taken by one of them might adversely affect the other and destabilize the learning. To avoid such issues, we propose to regularize the learning objective of the actor by penalizing the temporal difference (TD) error of the critic. This improves stability by avoiding large steps in the actor update whenever the critic is highly inaccurate. The resulting method, which we call the TD-regularized actor-critic method, is a simple plug-and-play approach to improve stability and overall performance of the actor-critic methods. Evaluations on standard benchmarks confirm this.

READ FULL TEXT

page 2

page 16

page 18

page 25

research
10/24/2022

AACHER: Assorted Actor-Critic Deep Reinforcement Learning with Hindsight Experience Replay

Actor learning and critic learning are two components of the outstanding...
research
07/12/2021

Cautious Actor-Critic

The oscillating performance of off-policy learning and persisting errors...
research
02/23/2021

Good Actors can come in Smaller Sizes: A Case Study on the Value of Actor-Critic Asymmetry

Actors and critics in actor-critic reinforcement learning algorithms are...
research
10/02/2020

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

We investigate the discounting mismatch in actor-critic algorithm implem...
research
12/29/2017

Boosting the Actor with Dual Critic

This paper proposes a new actor-critic-style algorithm called Dual Actor...
research
12/04/2017

Hierarchical Actor-Critic

We present a novel approach to hierarchical reinforcement learning calle...
research
10/06/2016

Connecting Generative Adversarial Networks and Actor-Critic Methods

Both generative adversarial networks (GAN) in unsupervised learning and ...

Please sign up or login with your details

Forgot password? Click here to reset