Robo-PlaNet: Learning to Poke in a Day

Recently, the Deep Planning Network (PlaNet) approach was introduced as a model-based reinforcement learning method that learns environment dynamics directly from pixel observations. This architecture is useful for learning tasks in which either the agent does not have access to meaningful states (like position/velocity of robotic joints) or where the observed states significantly deviate from the physical state of the agent (which is commonly the case in low-cost robots in the form of backlash or noisy joint readings). PlaNet, by design, interleaves phases of training the dynamics model with phases of collecting more data on the target environment, leading to long training times. In this work, we introduce Robo-PlaNet, an asynchronous version of PlaNet. This algorithm consistently reaches higher performance in the same amount of time, which we demonstrate in both a simulated and a real robotic experiment.

READ FULL TEXT

page 1

page 2

research
11/12/2018

Learning Latent Dynamics for Planning from Pixels

Planning has been very successful for control tasks with known environme...
research
03/23/2022

Asynchronous Reinforcement Learning for Real-Time Control of Physical Robots

An oft-ignored challenge of real-world reinforcement learning is that th...
research
10/28/2019

Asynchronous Methods for Model-Based Reinforcement Learning

Significant progress has been made in the area of model-based reinforcem...
research
10/16/2012

Learning STRIPS Operators from Noisy and Incomplete Observations

Agents learning to act autonomously in real-world domains must acquire a...
research
02/21/2017

Towards a Common Implementation of Reinforcement Learning for Multiple Robotic Tasks

Mobile robots are increasingly being employed for performing complex tas...
research
10/09/2019

Ctrl-Z: Recovering from Instability in Reinforcement Learning

When learning behavior, training data is often generated by the learner ...
research
10/03/2020

Episodic Memory for Learning Subjective-Timescale Models

In model-based learning, an agent's model is commonly defined over trans...

Please sign up or login with your details

Forgot password? Click here to reset