We study worst-case guarantees on the expected return of fixed-dataset p...
Many reinforcement learning (RL) tasks provide the agent with
high-dimen...
Few-shot classification refers to learning a classifier for new classes ...
Reinforcement learning (RL) typically defines a discount factor as part ...
In this paper we revisit the method of off-policy corrections for
reinfo...
Deep reinforcement learning (deep RL) research has grown significantly i...