Optimistic Agents are Asymptotically Optimal

09/29/2012
by   Peter Sunehag, et al.
0

We use optimism to introduce generic asymptotically optimal reinforcement learning agents. They achieve, with an arbitrary finite or compact class of environments, asymptotically optimal behavior. Furthermore, in the finite deterministic case we provide finite error bounds.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset