Learning to evaluate and improve policies is a core problem of Reinforce...
Many reinforcement learning algorithms use value functions to guide the
...
We want to make progress toward artificial general intelligence, namely
...
We present new results on learning temporally extended actions for
conti...
Recent work has shown that temporally extended actions (options) can be
...
We explore deep reinforcement learning methods for multi-agent domains. ...
Eligibility traces in reinforcement learning are used as a bias-variance...
Temporal abstraction is key to scaling up learning and planning in
reinf...