PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning

06/08/2021
by   Tao Yu, et al.
0

Learning good feature representations is important for deep reinforcement learning (RL). However, with limited experience, RL often suffers from data inefficiency for training. For un-experienced or less-experienced trajectories (i.e., state-action sequences), the lack of data limits the use of them for better feature learning. In this work, we propose a novel method, dubbed PlayVirtual, which augments cycle-consistent virtual trajectories to enhance the data efficiency for RL feature representation learning. Specifically, PlayVirtual predicts future states based on the current state and action by a dynamics model and then predicts the previous states by a backward dynamics model, which forms a trajectory cycle. Based on this, we augment the actions to generate a large amount of virtual state-action trajectories. Being free of groudtruth state supervision, we enforce a trajectory to meet the cycle consistency constraint, which can significantly enhance the data efficiency. We validate the effectiveness of our designs on the Atari and DeepMind Control Suite benchmarks. Our method outperforms the current state-of-the-art methods by a large margin on both benchmarks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2022

Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning

Deep reinforcement learning (RL) algorithms suffer severe performance de...
research
11/24/2021

Learning State Representations via Retracing in Reinforcement Learning

We propose learning via retracing, a novel self-supervised approach for ...
research
01/18/2022

Accelerating Representation Learning with View-Consistent Dynamics in Data-Efficient Reinforcement Learning

Learning informative representations from image-based observations is of...
research
08/25/2019

Dynamics-aware Embeddings

In this paper we consider self-supervised representation learning to imp...
research
01/31/2023

CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning

This paper addresses the problem of visual feature representation learni...
research
08/26/2022

Visual processing in context of reinforcement learning

Although deep reinforcement learning (RL) has recently enjoyed many succ...
research
04/02/2018

Recall Traces: Backtracking Models for Efficient Reinforcement Learning

In many environments only a tiny subset of all states yield high reward....

Please sign up or login with your details

Forgot password? Click here to reset