Improved Regret Bounds for Online Kernel Selection under Bandit Feedback

03/09/2023
by   Junfan Li, et al.
0

In this paper, we improve the regret bound for online kernel selection under bandit feedback. Previous algorithm enjoys a O((‖ f‖^2_ℋ_i+1)K^1/3T^2/3) expected bound for Lipschitz loss functions. We prove two types of regret bounds improving the previous bound. For smooth loss functions, we propose an algorithm with a O(U^2/3K^-1/3(∑^K_i=1L_T(f^∗_i))^2/3) expected bound where L_T(f^∗_i) is the cumulative losses of optimal hypothesis in ℍ_i={f∈ℋ_i:‖ f‖_ℋ_i≤ U}. The data-dependent bound keeps the previous worst-case bound and is smaller if most of candidate kernels match well with the data. For Lipschitz loss functions, we propose an algorithm with a O(U√(KT)ln^2/3T) expected bound asymptotically improving the previous bound. We apply the two algorithms to online kernel selection with time constraint and prove new regret bounds matching or improving the previous O(√(TlnK) +‖ f‖^2_ℋ_imax{√(T),T/√(ℛ)}) expected bound where ℛ is the time budget. Finally, we empirically verify our algorithms on online regression and classification tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2017

Efficient Online Bandit Multiclass Learning with Õ(√(T)) Regret

We present an efficient second-order algorithm with Õ(1/η√(T)) regret fo...
research
07/24/2020

Exploiting the Surrogate Gap in Online Multiclass Classification

We present Gaptron, a randomized first-order algorithm for online multic...
research
08/30/2013

Online Ranking: Discrete Choice, Spearman Correlation and Other Feedback

Given a set V of n objects, an online ranking system outputs at each tim...
research
02/27/2017

Algorithmic Chaining and the Role of Partial Feedback in Online Nonparametric Learning

We investigate contextual online learning with nonparametric (Lipschitz)...
research
12/26/2022

Improved Kernel Alignment Regret Bound for Online Kernel Learning

In this paper, we improve the kernel alignment regret bound for online k...
research
09/07/2020

Non-exponentially weighted aggregation: regret bounds for unbounded loss functions

We tackle the problem of online optimization with a general, possibly un...
research
05/01/2023

First- and Second-Order Bounds for Adversarial Linear Contextual Bandits

We consider the adversarial linear contextual bandit setting, which allo...

Please sign up or login with your details

Forgot password? Click here to reset