A Concentration Bound for LSPE(λ)

11/04/2021

∙

by Vivek S. Borkar, et al.

∙

The popular LSPE(λ) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.

READ FULL TEXT

A Concentration Bound for LSPE(λ)

Sign in with Google

Consider DeepAI Pro