POND: Pessimistic-Optimistic oNline Dispatch

10/20/2020
by   Xin Liu, et al.
0

This paper considers constrained online dispatch with unknown arrival, reward and constraint distributions. We propose a novel online dispatch algorithm, named POND, standing for Pessimistic-Optimistic oNline Dispatch, which achieves O(√(T)) regret and O(1) constraint violation. Both bounds are sharp. Our experiments on synthetic and real datasets show that POND achieves low regret with minimal constraint violations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2021

A Lyapunov-Based Methodology for Constrained Optimization with Bandit Feedback

In a wide variety of applications including online advertising, contract...
research
04/08/2016

A Low Complexity Algorithm with O(√(T)) Regret and Finite Constraint Violations for Online Convex Optimization with Long Term Constraints

This paper considers online convex optimization over a complicated const...
research
02/25/2019

Improved Algorithm on Online Clustering of Bandits

We generalize the setting of online clustering of bandits by allowing no...
research
01/24/2023

Gossiped and Quantized Online Multi-Kernel Learning

In instances of online kernel learning where little prior information is...
research
11/18/2020

Online Paging with a Vanishing Regret

This paper considers a variant of the online paging problem, where the o...
research
01/25/2018

Nonparametric Hawkes Processes: Online Estimation and Generalization Bounds

In this paper, we design a nonparametric online algorithm for estimating...
research
09/15/2019

Online k-means Clustering

We study the problem of online clustering where a clustering algorithm h...

Please sign up or login with your details

Forgot password? Click here to reset