research
∙
06/29/2021
A Convergent and Efficient Deep Q Network Algorithm
Despite the empirical success of the deep Q network (DQN) reinforcement ...
research
∙
02/12/2020
LaProp: a Better Way to Combine Momentum with Adaptive Gradient
Identifying a divergence problem in Adam, we propose a new optimizer, La...
research
∙
10/21/2019