Empirical design in reinforcement learning is no small task. Running goo...
Most value function learning algorithms in reinforcement learning are ba...
The policy gradient theorem (Sutton et al., 2000) prescribes the usage o...
Many reinforcement learning algorithms rely on value estimation. However...
We present ℛℒ_1-𝒢𝒫, a control framework that enables
safe simultaneous l...
It is still common to use Q-learning and temporal difference (TD)
learni...
Autonomous agents must be able to safely interact with other vehicles to...
Collision prediction in a dynamic and unknown environment relies on know...
In motion planning problems for autonomous robots, such as self-driving ...
This paper investigates the problem of online prediction learning, where...
In this paper we show that restricting the representation-layer of a
Rec...
Model-based strategies for control are critical to obtain sample efficie...