Multi-armed Bandit motivates methods with provable upper bounds on regre...
We study a decentralized multi-agent multi-armed bandit problem in which...
We study Pareto optimality in multi-objective multi-armed bandit by prov...
Uplift modeling is a rapidly growing approach that utilizes machine lear...
EXP-based algorithms are often used for exploration in multi-armed bandi...