research
∙
05/18/2018
Two geometric input transformation methods for fast online reinforcement learning with neural nets
We apply neural nets with ReLU gates in online reinforcement learning. O...
research
∙
12/27/2017
On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning
We consider off-policy temporal-difference (TD) learning methods for pol...
research
∙
07/11/2012
Discretized Approximations for POMDP with Average Cost
In this paper, we propose a new lower approximation scheme for POMDP wit...
research
∙
07/04/2012