Transformer-based large-scale language models (LLMs) are able to generat...
When we exercise sequences of actions, their execution becomes more flue...
Everything else being equal, simpler models should be preferred over mor...
Conditional value-at-risk (CVaR) precisely characterizes the influence t...
Distributional reinforcement learning (RL) – in which agents learn about...
We consider the problem of learning to communicate using multi-agent
rei...
Monte-Carlo Tree Search (MCTS) is one of the most-widely used methods fo...
We investigate how reinforcement learning agents can learn to cooperate....
Existing Bayesian treatments of neural networks are typically characteri...
Neural networks trained with backpropagation often struggle to identify
...
Reciprocating interactions represent a central feature of all human
exch...
The computational costs of inference and planning have confined Bayesian...
Bayesian model-based reinforcement learning is a formally elegant approa...