Rollout Algorithms and Approximate Dynamic Programming for Bayesian Optimization and Sequential Estimation

12/15/2022
by   Dimitri Bertsekas, et al.
0

We provide a unifying approximate dynamic programming framework that applies to a broad variety of problems involving sequential estimation. We consider first the construction of surrogate cost functions for the purposes of optimization, and we focus on the special case of Bayesian optimization, using the rollout algorithm and some of its variations. We then discuss the more general case of sequential estimation of a random vector using optimal measurement selection, and its application to problems of stochastic and adaptive control. We distinguish between adaptive control of deterministic and stochastic systems: the former are better suited for the use of rollout, while the latter are well suited for the use of rollout with certainty equivalence approximations. As an example of the deterministic case, we discuss sequential decoding problems, and a rollout algorithm for the approximate solution of the Wordle and Mastermind puzzles, recently developed in the paper [BBB22].

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/01/2022

The Parametric Cost Function Approximation: A new approach for multistage stochastic programming

The most common approaches for solving multistage stochastic programming...
research
08/20/2021

Lessons from AlphaZero for Optimal, Model Predictive, and Adaptive Control

In this paper we aim to provide analysis and insights (often based on vi...
research
03/28/2023

Numerical Methods for Convex Multistage Stochastic Optimization

Optimization problems involving sequential decisions in a stochastic env...
research
06/25/2021

Optimal Checkpointing for Adjoint Multistage Time-Stepping Schemes

We consider checkpointing strategies that minimize the number of recompu...
research
11/04/2019

Lookahead Bayesian Optimization via Rollout: Guarantees and Sequential Rolling Horizons

Lookahead, also known as non-myopic, Bayesian optimization (BO) aims to ...
research
10/07/2017

Ranking and Selection as Stochastic Control

Under a Bayesian framework, we formulate the fully sequential sampling a...
research
12/14/2020

Cost-sensitive Hierarchical Clustering for Dynamic Classifier Selection

We consider the dynamic classifier selection (DCS) problem: Given an ens...

Please sign up or login with your details

Forgot password? Click here to reset