Learning Navigation Skills for Legged Robots with Learned Robot Embeddings

11/24/2020
by   Joanne Truong, et al.
0

Navigation policies are commonly learned on idealized cylinder agents in simulation, without modelling complex dynamics, like contact dynamics, arising from the interaction between the robot and the environment. Such policies perform poorly when deployed on complex and dynamic robots, such as legged robots. In this work, we learn hierarchical navigation policies that account for the low-level dynamics of legged robots, such as maximum speed, slipping, and achieve good performance at navigating cluttered indoor environments. Once such a policy is learned on one legged robot, it does not directly generalize to a different robot due to dynamical differences, which increases the cost of learning such a policy on new robots. To overcome this challenge, we learn dynamics-aware navigation policies across multiple robots with robot-specific embeddings, which enable generalization to new unseen robots. We train our policies across three legged robots - 2 quadrupeds (A1, AlienGo) and a hexapod (Daisy). At test time, we study the performance of our learned policy on two new legged robots (Laikago, 4-legged Daisy) and show that our learned policy can sample-efficiently generalize to previously unseen robots.

READ FULL TEXT

page 1

page 2

page 4

page 6

research
07/11/2018

Learning Deployable Navigation Policies at Kilometer Scale from a Single Traversal

Model-free reinforcement learning has recently been shown to be effectiv...
research
12/05/2022

Learning Representations that Enable Generalization in Assistive Tasks

Recent work in sim2real has successfully enabled robots to act in physic...
research
04/21/2023

Spatial-Language Attention Policies for Efficient Robot Learning

We investigate how to build and train spatial representations for robot ...
research
03/14/2023

RE-MOVE: An Adaptive Policy Design Approach for Dynamic Environments via Language-Based Feedback

Reinforcement learning-based policies for continuous control robotic nav...
research
04/15/2023

BVIP Guiding System with Adaptability to Individual Differences

Guiding robots can not only detect close-range obstacles like other guid...
research
03/23/2019

Long Range Neural Navigation Policies for the Real World

Learned Neural Network based policies have shown promising results for r...
research
04/24/2023

Quality-Diversity Optimisation on a Physical Robot Through Dynamics-Aware and Reset-Free Learning

Learning algorithms, like Quality-Diversity (QD), can be used to acquire...

Please sign up or login with your details

Forgot password? Click here to reset