This paper considers a stochastic multi-armed bandit (MAB) problem with ...
Diffusion on graphs is ubiquitous with numerous high-impact applications...
This paper considers a class of reinforcement learning problems, which
i...
We study model-free reinforcement learning (RL) algorithms in episodic
n...
This paper studies online nonstochastic control problems with adversaria...
Risk-sensitive reinforcement learning (RL) has become a popular tool to
...
Fair resource allocation is one of the most important topics in communic...
This paper studies a class of multi-agent reinforcement learning (MARL)
...
Multi-server queueing systems are widely used models for job scheduling ...
We consider the problem of controlling a mutated diffusion process with ...
Bilevel optimization has arisen as a powerful tool for solving a variety...
Multi-armed bandit (MAB) is a classic model for understanding the
explor...
This paper studies the sensitivity (or insensitivity) of a class of load...
Obstacle avoidance for small unmanned aircraft is vital for the safety o...
This paper presents the first model-free, simulator-free
reinforcement l...
This paper considers stochastic linear bandits with general constraints....
This paper provides a recipe for deriving calculable approximation error...
This paper considers constrained online dispatch with unknown arrival, r...
In this paper, we propose a new type of Actor, named forward-looking Act...
In this paper, we establish a theoretical comparison between the asympto...
We study two time-scale linear stochastic approximation algorithms, whic...
This paper combines data-driven and model-driven methods for real-time
m...
We consider the dynamics of a linear stochastic approximation algorithm
...
We consider a connection-level model proposed by Massoulié and Roberts
f...
In this paper, we consider a popular model for collaborative filtering i...
In standard clustering problems, data points are represented by vectors,...