Adaptive Regret for Control of Time-Varying Dynamics

07/08/2020
by   Paula Gradu, et al.
0

We consider regret minimization for online control with time-varying linear dynamical systems. The metric of performance we study is adaptive policy regret, or regret compared to the best policy on any interval in time. We give an efficient algorithm that attains first-order adaptive regret guarantees for the setting of online convex optimization with memory. We also show that these first-order bounds are nearly tight. This algorithm is then used to derive a controller with adaptive regret guarantees that provably competes with the best linear controller on any interval in time. We validate these theoretical findings experimentally on simulations of time-varying dynamics and disturbances.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset