Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes

06/01/2020
by   Sooraj Boominathan, et al.
5

In several medical decision-making problems, such as antibiotic prescription, laboratory testing can provide precise indications for how a patient will respond to different treatment options. This enables us to "fully observe" all potential treatment outcomes, but while present in historical data, these results are infeasible to produce in real-time at the point of the initial treatment decision. Moreover, treatment policies in these settings often need to trade off between multiple competing objectives, such as effectiveness of treatment and harmful side effects. We present, compare, and evaluate three approaches for learning individualized treatment policies in this setting: First, we consider two indirect approaches, which use predictive models of treatment response to construct policies optimal for different trade-offs between objectives. Second, we consider a direct approach that constructs such a set of policies without any intermediate models of outcomes. Using a medical dataset of Urinary Tract Infection (UTI) patients, we show that all approaches are able to find policies that achieve strictly better performance on all outcomes than clinicians, while also trading off between different objectives as desired. We demonstrate additional benefits of the direct approach, including flexibly incorporating other goals such as deferral to physicians on simple cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/04/2018

Algorithmic Decision Making in the Presence of Unmeasured Confounding

On a variety of complex decision-making tasks, from doctors prescribing ...
research
06/21/2022

Policy learning with asymmetric utilities

Data-driven decision making plays an important role even in high stakes ...
research
05/23/2019

Learning When-to-Treat Policies

Many applied decision-making problems have a dynamic component: The poli...
research
05/27/2019

Contest Architecture with Public Disclosures

I study optimal disclosure policies in sequential contests. A contest de...
research
10/10/2019

Estimation of Utility-Maximizing Bounds on Potential Outcomes

Estimation of individual treatment effects is often used as the basis fo...
research
07/09/2021

Offline reinforcement learning with uncertainty for treatment strategies in sepsis

Guideline-based treatment for sepsis and septic shock is difficult becau...
research
10/08/2021

Medical Dead-ends and Learning to Identify High-risk States and Treatments

Machine learning has successfully framed many sequential decision making...

Please sign up or login with your details

Forgot password? Click here to reset