Online learning methods yield sequential regret bounds under minimal
ass...
We investigate online classification with paid stochastic experts. Here,...
We study a K-armed bandit with delayed feedback and intermediate
observa...
We derive a new analysis of Follow The Regularized Leader (FTRL) for onl...
The framework of feedback graphs is a generalization of sequential
decis...
We consider prediction with expert advice for strongly convex and bounde...
We consider online learning with feedback graphs, a sequential
decision-...
We study nonstochastic bandits and experts in a delayed setting where de...
We study the problem of online multiclass classification in a setting wh...
In this paper we consider a distributed online learning
setting for jo...
We provide a new adaptive method for online convex optimization, MetaGra...
We present Gaptron, a randomized first-order algorithm for online multic...
We study bandit convex optimization methods that adapt to the norm of th...
A standard introduction to online learning might place Online Gradient
D...