Efficient Contextual Bandits with Knapsacks via Regression

11/14/2022
by   Aleksandrs Slivkins, et al.
0

We consider contextual bandits with knapsacks (CBwK), a variant of the contextual bandit which places global constraints on budget consumption. We present a new algorithm that is simple, statistically optimal, and computationally efficient. Our algorithm combines LagrangeBwK (Immorlica et al., FOCS'19), a Lagrangian-based technique for CBwK, and SquareCB (Foster and Rakhlin, ICML'20), a regression-based technique for contextual bandits. Our analysis emphasizes the modularity of both techniques.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset