In a single-agent setting, reinforcement learning (RL) tasks can be cast...
Real-time bidding, as one of the most popular mechanisms for selling onl...
We propose a new reasoning protocol called generalized recursive reasoni...
Humans are capable of attributing latent mental contents such as beliefs...
Deep Q-learning has achieved a significant success in single-agent decis...
We conduct an empirical study on discovering the ordered collective dyna...
In this paper, we conduct an empirical study on discovering the ordered
...
In typical reinforcement learning (RL), the environment is assumed given...
Many artificial intelligence (AI) applications often require multiple
in...
Recently, the rapid development of word embedding and neural networks ha...