In a broad class of reinforcement learning applications, stochastic rewa...
We study the optimal scheduling problem where n source nodes attempt to
...
In a wide variety of applications including online advertising, contract...
Time-constrained decision processes have been ubiquitous in many fundame...
The theory of discrete-time online learning has been successfully applie...
We consider a budget-constrained bandit problem where each arm pull incu...
A significant challenge for future virtual reality (VR) applications is ...
The increase in demand for spectrum-based services forms a bottleneck in...
We study the problem of serving randomly arriving and delay-sensitive tr...
The behavior of users in relatively predictable, both in terms of the da...
In this paper, we introduce the COmbinatorial Multi-Objective Multi-Arme...
We study the stochastic multi-armed bandit (MAB) problem in the presence...