Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text

05/19/2020
by   Felix Hill, et al.
0

Recent work has described neural-network-based agents that are trained with reinforcement learning (RL) to execute language-like commands in simulated worlds, as a step towards an intelligent agent or robot that can be instructed by human users. However, the optimisation of multi-goal motor policies via deep RL from scratch requires many episodes of experience. Consequently, instruction-following with deep RL typically involves language generated from templates (by an environment simulator), which does not reflect the varied or ambiguous expressions of real users. Here, we propose a conceptually simple method for training instruction-following agents with deep RL that are robust to natural human instructions. By applying our method with a state-of-the-art pre-trained text-based language model (BERT), on tasks requiring agents to identify and position everyday objects relative to other objects in a naturalistic 3D simulated room, we demonstrate substantially-above-chance zero-shot transfer from synthetic template commands to natural instructions given by humans. Our approach is a general recipe for training any deep RL-based system to interface with human users, and bridges the gap between two research directions of notable recent success: agent-centric motor behavior and text-based representation learning.

READ FULL TEXT

page 2

page 15

page 16

research
11/08/2022

Learning to Follow Instructions in Text-Based Games

Text-based games present a unique class of sequential decision making pr...
research
09/03/2020

Grounded Language Learning Fast and Slow

Recent work has shown that large text-based neural language models, trai...
research
06/17/2020

Converting Biomechanical Models from OpenSim to MuJoCo

OpenSim is a widely used biomechanics simulator with several anatomicall...
research
04/13/2023

Language Instructed Reinforcement Learning for Human-AI Coordination

One of the fundamental quests of AI is to produce agents that coordinate...
research
11/01/2022

Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions

The adoption of pre-trained language models to generate action plans for...
research
03/01/2019

Learning To Follow Directions in Street View

Navigating and understanding the real world remains a key challenge in m...
research
10/21/2019

Self-Educated Language Agent With Hindsight Experience Replay For Instruction Following

Language creates a compact representation of the world and allows the de...

Please sign up or login with your details

Forgot password? Click here to reset