EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

06/21/2022
by   Jiayi Weng, et al.
8

There has been significant progress in developing reinforcement learning (RL) training systems. Past works such as IMPALA, Apex, Seed RL, Sample Factory, and others, aim to improve the system's overall throughput. In this paper, we aim to address a common bottleneck in the RL training system, i.e., parallel environment execution, which is often the slowest part of the whole system but receives little attention. With a curated design for paralleling RL environments, we have improved the RL environment simulation speed across different hardware setups, ranging from a laptop and a modest workstation, to a high-end machine such as NVIDIA DGX-A100. On a high-end machine, EnvPool achieves one million frames per second for the environment execution on Atari environments and three million frames per second on MuJoCo environments. When running EnvPool on a laptop, the speed is 2.8x that of the Python subprocess. Moreover, great compatibility with existing RL training libraries has been demonstrated in the open-sourced community, including CleanRL, rl_games, DeepMind Acme, etc. Finally, EnvPool allows researchers to iterate their ideas at a much faster pace and has great potential to become the de facto RL environment execution engine. Example runs show that it only takes five minutes to train agents to play Atari Pong and MuJoCo Ant on a laptop. EnvPool is open-sourced at https://github.com/sail-sg/envpool.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2019

TorchBeast: A PyTorch Platform for Distributed RL

TorchBeast is a platform for reinforcement learning (RL) research in PyT...
research
08/31/2021

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

Deep reinforcement learning (RL) is a powerful framework to train decisi...
research
10/03/2022

CaiRL: A High-Performance Reinforcement Learning Environment Toolkit

This paper addresses the dire need for a platform that efficiently provi...
research
06/21/2020

Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

Increasing the scale of reinforcement learning experiments has allowed r...
research
06/13/2023

Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second

We present Galactic, a large-scale simulation and reinforcement-learning...
research
10/15/2019

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference

We present a modern scalable reinforcement learning agent called SEED (S...
research
06/24/2023

Minigrid Miniworld: Modular Customizable Reinforcement Learning Environments for Goal-Oriented Tasks

We present the Minigrid and Miniworld libraries which provide a suite of...

Please sign up or login with your details

Forgot password? Click here to reset