While standard bandit algorithms sometimes incur high regret, their
perf...
We develop a reinforcement learning (RL) framework for applications that...
Discretization based approaches to solving online reinforcement learning...
We consider the problem of dividing limited resources to individuals arr...
We consider the problem of dividing limited resources between a set of a...
We introduce the technique of adaptive discretization to design efficien...
We present an efficient algorithm for model-free episodic reinforcement
...