research
          
      
      ∙
      06/29/2021
    A Convergent and Efficient Deep Q Network Algorithm
Despite the empirical success of the deep Q network (DQN) reinforcement ...
          
            research
          
      
      ∙
      02/12/2020
    LaProp: a Better Way to Combine Momentum with Adaptive Gradient
Identifying a divergence problem in Adam, we propose a new optimizer, La...
          
            research
          
      
      ∙
      10/21/2019