AutoRL Hyperparameter Landscapes

04/05/2023
by   Aditya Mohan, et al.
0

Although Reinforcement Learning (RL) has shown to be capable of producing impressive results, its use is limited by the impact of its hyperparameters on performance. This often makes it difficult to achieve good results in practice. Automated RL (AutoRL) addresses this difficulty, yet little is known about the dynamics of the hyperparameter landscapes that hyperparameter optimization (HPO) methods traverse in search of optimal configurations. In view of existing AutoRL approaches dynamically adjusting hyperparameter configurations, we propose an approach to build and analyze these hyperparameter landscapes not just for one point in time but at multiple points in time throughout training. Addressing an important open question on the legitimacy of such dynamic AutoRL approaches, we provide thorough empirical evidence that the hyperparameter landscapes strongly vary over time across representative algorithms from RL literature (DQN and SAC) in different kinds of environments (Cartpole and Hopper). This supports the theory that hyperparameters should be dynamically adjusted during training and shows the potential for more insights on AutoRL problems that can be gained through landscape analyses.

READ FULL TEXT

page 7

page 8

page 15

page 17

page 20

page 22

research
12/21/2022

Hyperparameters in Contextual RL are Highly Situational

Although Reinforcement Learning (RL) has shown impressive results in gam...
research
06/02/2023

Hyperparameters in Reinforcement Learning and How To Tune Them

In order to improve reproducibility, deep reinforcement learning (RL) ha...
research
06/18/2019

Towards White-box Benchmarks for Algorithm Control

The performance of many algorithms in the fields of hard combinatorial p...
research
05/18/2022

No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

The performance of reinforcement learning (RL) agents is sensitive to th...
research
02/26/2021

On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning

Model-based Reinforcement Learning (MBRL) is a promising framework for l...
research
07/19/2022

Bayesian Generational Population-Based Training

Reinforcement learning (RL) offers the potential for training generally ...
research
11/08/2021

Explaining Hyperparameter Optimization via Partial Dependence Plots

Automated hyperparameter optimization (HPO) can support practitioners to...

Please sign up or login with your details

Forgot password? Click here to reset