We investigate the fixed-budget best-arm identification (BAI) problem fo...
We investigate learning the equilibria in non-stationary multi-agent sys...
This paper investigates when one can efficiently recover an approximate ...
Learning Nash equilibria is a central problem in multi-agent systems. In...
This work considers the problem of selective-sampling for best-arm
ident...
We study exploration using randomized value functions in Thompson Sampli...
It is well known that quantifying uncertainty in the action-value estima...