Exploration in environments which differ across episodes has received
in...
Natural agents can effectively learn from multiple data sources that dif...
In recent years, a number of reinforcement learning (RL) methods have be...
Direct policy gradient methods for reinforcement learning are a successf...
We present an algorithm, HOMER, for exploration and reinforcement learni...
We present a new model-based algorithm for reinforcement learning (RL) w...
Learning a policy using only observational data is challenging because t...
In this work we introduce a new framework for performing temporal predic...
Planning actions using learned and differentiable forward models of the ...
We introduce a new model, the Recurrent Entity Network (EntNet). It is
e...
Although RNNs have been shown to be powerful tools for processing sequen...
Deep Learning's recent successes have mostly relied on Convolutional
Net...
We study the connection between the highly non-convex loss function of a...
Convolutional networks are one of the most widely employed architectures...