Efficient Contextual Bandits with Knapsacks via Regression

11/14/2022
by   Aleksandrs Slivkins, et al.
0

We consider contextual bandits with knapsacks (CBwK), a variant of the contextual bandit which places global constraints on budget consumption. We present a new algorithm that is simple, statistically optimal, and computationally efficient. Our algorithm combines LagrangeBwK (Immorlica et al., FOCS'19), a Lagrangian-based technique for CBwK, and SquareCB (Foster and Rakhlin, ICML'20), a regression-based technique for contextual bandits. Our analysis emphasizes the modularity of both techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2015

An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives

We consider a contextual version of multi-armed bandit problem with glob...
research
05/22/2022

Contextual Information-Directed Sampling

Information-directed sampling (IDS) has recently demonstrated its potent...
research
06/10/2020

Efficient Contextual Bandits with Continuous Actions

We create a computationally tractable algorithm for contextual bandits w...
research
01/28/2019

Target Tracking for Contextual Bandits: Application to Demand Side Management

We propose a contextual-bandit approach for demand side management by of...
research
02/07/2021

Bandits for Learning to Explain from Explanations

We introduce Explearn, an online algorithm that learns to jointly output...
research
05/06/2021

Contextual Bandits with Sparse Data in Web setting

This paper is a scoping study to identify current methods used in handli...
research
05/31/2022

Provably and Practically Efficient Neural Contextual Bandits

We consider the neural contextual bandit problem. In contrast to the exi...

Please sign up or login with your details

Forgot password? Click here to reset