ORL: Reinforcement Learning Benchmarks for Online Stochastic Optimization Problems

11/24/2019
by   Bharathan Balaji, et al.
13

Reinforcement Learning (RL) has achieved state-of-the-art results in domains such as robotics and games. We build on this previous work by applying RL algorithms to a selection of canonical online stochastic optimization problems with a range of practical applications: Bin Packing, Newsvendor, and Vehicle Routing. While there is a nascent literature that applies RL to these problems, there are no commonly accepted benchmarks which can be used to compare proposed approaches rigorously in terms of performance, scale, or generalizability. This paper aims to fill that gap. For each problem we apply both standard approaches as well as newer RL algorithms and analyze results. In each case, the performance of the trained RL policy is competitive with or superior to the corresponding baselines, while not requiring much in the way of domain knowledge. This highlights the potential of RL in real-world dynamic resource allocation problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2020

OR-Gym: A Reinforcement Learning Library for Operations Research Problem

Reinforcement learning (RL) has been widely applied to game-playing and ...
research
06/16/2019

Reinforcement Learning Driven Heuristic Optimization

Heuristic algorithms such as simulated annealing, Concorde, and METIS ar...
research
05/09/2023

Assessment of Reinforcement Learning Algorithms for Nuclear Power Plant Fuel Optimization

The nuclear fuel loading pattern optimization problem has been studied s...
research
09/13/2019

Towards an Adaptive Robot for Sports and Rehabilitation Coaching

The work presented in this paper aims to explore how, and to what extent...
research
05/21/2014

A Comparison of Monte Carlo Tree Search and Mathematical Optimization for Large Scale Dynamic Resource Allocation

Dynamic resource allocation (DRA) problems are an important class of dyn...
research
02/24/2018

Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

Evolution Strategies (ES) have recently been demonstrated to be a viable...
research
12/30/2021

Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning

Online reinforcement learning (RL) algorithms are often difficult to dep...

Please sign up or login with your details

Forgot password? Click here to reset