RE-MOVE: An Adaptive Policy Design Approach for Dynamic Environments via Language-Based Feedback

03/14/2023
by   Souradip Chakraborty, et al.
4

Reinforcement learning-based policies for continuous control robotic navigation tasks often fail to adapt to changes in the environment during real-time deployment, which may result in catastrophic failures. To address this limitation, we propose a novel approach called RE-MOVE (REquest help and MOVE on), which uses language-based feedback to adjust trained policies to real-time changes in the environment. In this work, we enable the trained policy to decide when to ask for feedback and how to incorporate feedback into trained policies. RE-MOVE incorporates epistemic uncertainty to determine the optimal time to request feedback from humans and uses language-based feedback for real-time adaptation. We perform extensive synthetic and real-world evaluations to demonstrate the benefits of our proposed approach in several test-time dynamic navigation scenarios. Our approach enable robots to learn from human feedback and adapt to previously unseen adversarial situations.

READ FULL TEXT

page 1

page 7

page 8

page 11

page 12

page 13

page 14

page 15

research
11/24/2020

Learning Navigation Skills for Legged Robots with Learned Robot Embeddings

Navigation policies are commonly learned on idealized cylinder agents in...
research
10/26/2020

On Embodied Visual Navigation in Real Environments Through Habitat

Visual navigation models based on deep learning can learn effective poli...
research
06/07/2020

Learning Behaviors with Uncertain Human Feedback

Human feedback is widely used to train agents in many domains. However, ...
research
12/03/2021

Improving the Robustness of Reinforcement Learning Policies with ℒ_1 Adaptive Control

A reinforcement learning (RL) control policy trained in a nominal enviro...
research
05/11/2021

A Meta Reinforcement Learning-based Approach for Self-Adaptive System

A self-learning adaptive system (SLAS) uses machine learning to enable a...
research
03/06/2023

Robustness of Utilizing Feedback in Embodied Visual Navigation

This paper presents a framework for training an agent to actively reques...
research
05/06/2022

Robot navigation from human demonstration: learning control behaviors with environment feature maps

When working alongside human collaborators in dynamic and unstructured e...

Please sign up or login with your details

Forgot password? Click here to reset