In this paper, we present the Adaptive EntropyTree Search (ANTS) algorit...
Sample efficiency and performance in the offline setting have emerged as...
We propose a reinforcement learning framework for discrete environments ...
Model-free reinforcement learning (RL) can be used to learn effective
po...