Average Cost Optimal Control of Stochastic Systems Using Reinforcement Learning

10/13/2020
by   Jing Lai, et al.
0

This paper addresses the average cost minimization problem for discrete-time systems with multiplicative and additive noises via reinforcement learning. By using Q-function, we propose an online learning scheme to estimate the kernel matrix of Q-function and to update the control gain using the data along the system trajectories. The obtained control gain and kernel matrix are proved to converge to the optimal ones. To implement the proposed learning scheme, an online model-free reinforcement learning algorithm is given, where recursive least squares method is used to estimate the kernel matrix of Q-function. A numerical example is presented to illustrate the proposed approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2020

Model-free optimal control of discrete-time systems with additive and multiplicative noises

This paper investigates the optimal control problem for a class of discr...
research
10/01/2021

Design of multiplicative watermarking against covert attacks

This paper addresses the design of an active cyberattack detection archi...
research
07/16/2021

Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems

This paper studies the adaptive optimal stationary control of continuous...
research
06/03/2011

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

The recursive least-squares (RLS) algorithm is one of the most well-know...
research
09/18/2018

Model-Free Adaptive Optimal Control of Sequential Manufacturing Processes using Reinforcement Learning

A self-learning optimal control algorithm for sequential manufacturing p...
research
07/21/2014

Practical Kernel-Based Reinforcement Learning

Kernel-based reinforcement learning (KBRL) stands out among reinforcemen...

Please sign up or login with your details

Forgot password? Click here to reset