MPC-based Reinforcement Learning for Economic Problems with Application to Battery Storage

04/06/2021
by   Arash Bahari Kordabad, et al.
0

In this paper, we are interested in optimal control problems with purely economic costs, which often yield optimal policies having a (nearly) bang-bang structure. We focus on policy approximations based on Model Predictive Control (MPC) and the use of the deterministic policy gradient method to optimize the MPC closed-loop performance in the presence of unmodelled stochasticity or model error. When the policy has a (nearly) bang-bang structure, we observe that the policy gradient method can struggle to produce meaningful steps in the policy parameters. To tackle this issue, we propose a homotopy strategy based on the interior-point method, providing a relaxation of the policy during the learning. We investigate a specific well-known battery storage problem, and show that the proposed method delivers a homogeneous and faster learning than a classical policy gradient approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2022

Policy Learning for Nonlinear Model Predictive Control with Application to USVs

The unaffordable computation load of nonlinear model predictive control ...
research
12/27/2021

Safe Reinforcement Learning with Chance-constrained Model Predictive Control

Real-world reinforcement learning (RL) problems often demand that agents...
research
03/25/2022

Quasi-Newton Iteration in Deterministic Policy Gradient

This paper presents a model-free approximation for the Hessian of the pe...
research
03/02/2021

Data-driven MIMO control of room temperature and bidirectional EV charging using deep reinforcement learning: simulation and experiments

The control of modern buildings is, on one hand, a complex multi-variabl...
research
11/22/2022

Evaluation of MPC-based Imitation Learning for Human-like Autonomous Driving

This work evaluates and analyzes the combination of imitation learning (...
research
04/20/2021

Model-predictive control and reinforcement learning in multi-energy system case studies

Model-predictive-control (MPC) offers an optimal control technique to es...
research
02/21/2023

UAV Path Planning Employing MPC- Reinforcement Learning Method for search and rescue mission

In this paper, we tackle the problem of Unmanned Aerial (UA V) path plan...

Please sign up or login with your details

Forgot password? Click here to reset