Reasoning and Generalization in RL: A Tool Use Perspective

07/03/2019
by   Sam Wenke, et al.
5

Learning to use tools to solve a variety of tasks is an innate ability of humans and has been observed of animals in the wild. However, the underlying mechanisms that are required to learn to use tools are abstract and widely contested in the literature. In this paper, we study tool use in the context of reinforcement learning and propose a framework for analyzing generalization inspired by a classic study of tool using behavior, the trap-tube task. Recently, it has become common in reinforcement learning to measure generalization performance on a single test set of environments. We instead propose transfers that produce multiple test sets that are used to measure specified types of generalization, inspired by abilities demonstrated by animal and human tool users. The source code to reproduce our experiments is publicly available at https://github.com/fomorians/gym_tool_use.

READ FULL TEXT

page 4

page 5

page 6

research
04/11/2022

JORLDY: a fully customizable open source framework for reinforcement learning

Recently, Reinforcement Learning (RL) has been actively researched in bo...
research
03/24/2023

marl-jax: Multi-agent Reinforcement Leaning framework for Social Generalization

Recent advances in Reinforcement Learning (RL) have led to many exciting...
research
12/23/2022

NARS vs. Reinforcement learning: ONA vs. Q-Learning

One of the realistic scenarios is taking a sequence of optimal actions t...
research
02/09/2021

rl_reach: Reproducible Reinforcement Learning Experiments for Robotic Reaching Tasks

Training reinforcement learning agents at solving a given task is highly...
research
09/13/2022

AnICA: Analyzing Inconsistencies in Microarchitectural Code Analyzers

Microarchitectural code analyzers, i.e., tools that estimate the through...
research
11/28/2022

Improved Representation of Asymmetrical Distances with Interval Quasimetric Embeddings

Asymmetrical distance structures (quasimetrics) are ubiquitous in our li...
research
07/31/2023

Learning Generalizable Tool Use with Non-rigid Grasp-pose Registration

Tool use, a hallmark feature of human intelligence, remains a challengin...

Please sign up or login with your details

Forgot password? Click here to reset