Learning Personalized Decision Support Policies

04/13/2023
by   Umang Bhatt, et al.
0

Individual human decision-makers may benefit from different forms of support to improve decision outcomes. However, a key question is which form of support will lead to accurate decisions at a low cost. In this work, we propose learning a decision support policy that, for a given input, chooses which form of support, if any, to provide. We consider decision-makers for whom we have no prior information and formalize learning their respective policies as a multi-objective optimization problem that trades off accuracy and cost. Using techniques from stochastic contextual bandits, we propose , an online algorithm to personalize a decision support policy for each decision-maker, and devise a hyper-parameter tuning strategy to identify a cost-performance trade-off using simulated human behavior. We provide computational experiments to demonstrate the benefits of compared to offline baselines. We then introduce , an interactive tool that provides with an interface. We conduct human subject experiments to show how learns policies personalized to each decision-maker and discuss the nuances of learning decision support policies online for real users.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset