Polybot: Training One Policy Across Robots While Embracing Variability

07/07/2023
by   Jonathan Yang, et al.
0

Reusing large datasets is crucial to scale vision-based robotic manipulators to everyday scenarios due to the high cost of collecting robotic datasets. However, robotic platforms possess varying control schemes, camera viewpoints, kinematic configurations, and end-effector morphologies, posing significant challenges when transferring manipulation skills from one platform to another. To tackle this problem, we propose a set of key design decisions to train a single policy for deployment on multiple robotic platforms. Our framework first aligns the observation and action spaces of our policy across embodiments via utilizing wrist cameras and a unified, but modular codebase. To bridge the remaining domain shift, we align our policy's internal representations across embodiments through contrastive learning. We evaluate our method on a dataset collected over 60 hours spanning 6 tasks and 3 robots with varying joint configurations and sizes: the WidowX 250S, the Franka Emika Panda, and the Sawyer. Our results demonstrate significant improvements in success rate and sample efficiency for our policy when using new task data collected on a different robot, validating our proposed design decisions. More details and videos can be found on our anonymized project website: https://sites.google.com/view/polybot-multirobot

READ FULL TEXT

page 2

page 4

page 6

page 13

page 16

research
02/05/2023

Multi-View Masked World Models for Visual Robotic Manipulation

Visual robotic manipulation research and applications often use multiple...
research
04/27/2020

The Ingredients of Real-World Robotic Reinforcement Learning

The success of reinforcement learning for real world robotics has been, ...
research
10/07/2019

DexPilot: Vision Based Teleoperation of Dexterous Robotic Hand-Arm System

Teleoperation offers the possibility of imparting robotic systems with s...
research
07/08/2023

Robust Learning-Based Incipient Slip Detection using the PapillArray Optical Tactile Sensor for Improved Robotic Gripping

The ability to detect slip, particularly incipient slip, enables robotic...
research
10/27/2020

COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning

Reinforcement learning has been applied to a wide variety of robotics pr...
research
06/29/2021

Survivable Robotic Control through Guided Bayesian Policy Search with Deep Reinforcement Learning

Many robot manipulation skills can be represented with deterministic cha...
research
08/07/2017

GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled Images

We tackle the problem of learning robotic sensorimotor control policies ...

Please sign up or login with your details

Forgot password? Click here to reset