Assessing Policy, Loss and Planning Combinations in Reinforcement Learning using a New Modular Architecture

01/08/2022
by   Tiago Gaspar Oliveira, et al.
3

The model-based reinforcement learning paradigm, which uses planning algorithms and neural network models, has recently achieved unprecedented results in diverse applications, leading to what is now known as deep reinforcement learning. These agents are quite complex and involve multiple components, factors that can create challenges for research. In this work, we propose a new modular software architecture suited for these types of agents, and a set of building blocks that can be easily reused and assembled to construct new model-based reinforcement learning agents. These building blocks include planning algorithms, policies, and loss functions. We illustrate the use of this architecture by combining several of these building blocks to implement and test agents that are optimized to three different test environments: Cartpole, Minigrid, and Tictactoe. One particular planning algorithm, made available in our implementation and not previously used in reinforcement learning, which we called averaged minimax, achieved good results in the three tested environments. Experiments performed with this architecture have shown that the best combination of planning algorithm, policy, and loss function is heavily problem dependent. This result provides evidence that the proposed architecture, which is modular and reusable, is useful for reinforcement learning researchers who want to study new environments and techniques.

READ FULL TEXT
research
07/19/2017

Imagination-Augmented Agents for Deep Reinforcement Learning

We introduce Imagination-Augmented Agents (I2As), a novel architecture f...
research
06/03/2021

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

We present an end-to-end, model-based deep reinforcement learning agent ...
research
11/08/2020

On the role of planning in model-based deep reinforcement learning

Model-based planning is often thought to be necessary for deep, careful ...
research
04/05/2019

Synthesized Policies for Transfer and Adaptation across Tasks and Environments

The ability to transfer in reinforcement learning is key towards buildin...
research
03/19/2020

Adjust Planning Strategies to Accommodate Reinforcement Learning Agents

In agent control issues, the idea of combining reinforcement learning an...
research
01/08/2020

LiftTiles: Constructive Building Blocks for Prototyping Room-scale Shape-changing Interfaces

Large-scale shape-changing interfaces have great potential, but creating...
research
06/13/2022

ATDN vSLAM: An all-through Deep Learning-Based Solution for Visual Simultaneous Localization and Mapping

In this paper, a novel solution is introduced for visual Simultaneous Lo...

Please sign up or login with your details

Forgot password? Click here to reset