A Concentration Bound for LSPE(λ)

11/04/2021
by   Vivek S. Borkar, et al.
4

The popular LSPE(λ) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset

Sign in with Google

×

Use your Google Account to sign in to DeepAI

×

Consider DeepAI Pro