Human-AI Collaboration with Bandit Feedback

05/22/2021
by   Ruijiang Gao, et al.
0

Human-machine complementarity is important when neither the algorithm nor the human yield dominant performance across all instances in a given domain. Most research on algorithmic decision-making solely centers on the algorithm's performance, while recent work that explores human-machine collaboration has framed the decision-making problems as classification tasks. In this paper, we first propose and then develop a solution for a novel human-machine collaboration problem in a bandit feedback setting. Our solution aims to exploit the human-machine complementarity to maximize decision rewards. We then extend our approach to settings with multiple human decision makers. We demonstrate the effectiveness of our proposed methods using both synthetic and real human responses, and find that our methods outperform both the algorithm and the human when they each make decisions on their own. We also show how personalized routing in the presence of multiple human decision-makers can further improve the human-machine team performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2023

Learning Complementary Policies for Human-AI Teams

Human-AI complementarity is important when neither the algorithm nor the...
research
06/27/2022

Human-AI Collaboration in Decision-Making: Beyond Learning to Defer

Human-AI collaboration (HAIC) in decision-making aims to create synergis...
research
05/29/2023

An Emergency Disposal Decision-making Method with Human–Machine Collaboration

Rapid developments in artificial intelligence technology have led to unm...
research
06/23/2023

Co-creating a globally interpretable model with human input

We consider an aggregated human-AI collaboration aimed at generating a j...
research
06/07/2020

Implications of Human Irrationality for Reinforcement Learning

Recent work in the behavioural sciences has begun to overturn the long-h...
research
02/17/2022

Human-Algorithm Collaboration: Achieving Complementarity and Avoiding Unfairness

Much of machine learning research focuses on predictive accuracy: given ...
research
08/22/2023

When Are Two Lists Better than One?: Benefits and Harms in Joint Decision-making

Historically, much of machine learning research has focused on the perfo...

Please sign up or login with your details

Forgot password? Click here to reset