Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods

02/28/2018
by   Deirdre Quillen, et al.
0

In this paper, we explore deep reinforcement learning algorithms for vision-based robotic grasping. Model-free deep reinforcement learning (RL) has been successfully applied to a range of challenging environments, but the proliferation of algorithms makes it difficult to discern which particular approach would be best suited for a rich, diverse task like grasping. To answer this question, we propose a simulated benchmark for robotic grasping that emphasizes off-policy learning and generalization to unseen objects. Off-policy learning enables utilization of grasping data over a wide variety of objects, and diversity is important to enable the method to generalize to new objects that were not seen during training. We evaluate the benchmark tasks against a variety of Q-function estimation methods, a method previously proposed for robotic grasping with deep neural network models, and a novel approach based on a combination of Monte Carlo return estimation and an off-policy correction. Our results indicate that several simple methods provide a surprisingly strong competitor to popular algorithms such as double Q-learning, and our analysis of stability sheds light on the relative tradeoffs between the algorithms.

READ FULL TEXT

page 3

page 7

research
07/02/2020

Towards Generalization and Data Efficient Learning of Deep Robotic Grasping

Deep reinforcement learning (DRL) has been proven to be a powerful parad...
research
08/01/2022

Learning to Grasp on the Moon from 3D Octree Observations with Deep Reinforcement Learning

Extraterrestrial rovers with a general-purpose robotic arm have many pot...
research
07/16/2020

Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators

Developing personal robots that can perform a diverse range of manipulat...
research
04/26/2021

End-to-end grasping policies for human-in-the-loop robots via deep reinforcement learning

State-of-the-art human-in-the-loop robot grasping is hugely suffered by ...
research
04/13/2020

Thinking While Moving: Deep Reinforcement Learning with Concurrent Control

We study reinforcement learning in settings where sampling an action fro...
research
05/17/2019

REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning

Standardized evaluation measures have aided in the progress of machine l...

Please sign up or login with your details

Forgot password? Click here to reset