Quantifying Inequality in Underreported Medical Conditions

10/08/2021
by   Divya Shanmugam, et al.
12

Estimating the prevalence of a medical condition, or the proportion of the population in which it occurs, is a fundamental problem in healthcare and public health. Accurate estimates of the relative prevalence across groups – capturing, for example, that a condition affects women more frequently than men – facilitate effective and equitable health policy which prioritizes groups who are disproportionately affected by a condition. However, it is difficult to estimate relative prevalence when a medical condition is underreported. In this work, we provide a method for accurately estimating the relative prevalence of underreported medical conditions, building upon the positive unlabeled learning framework. We show that under the commonly made covariate shift assumption – i.e., that the probability of having a disease conditional on symptoms remains constant across groups – we can recover the relative prevalence, even without restrictive assumptions commonly made in positive unlabeled learning and even if it is impossible to recover the absolute prevalence. We provide a suite of experiments on synthetic and real health data that demonstrate our method's ability to recover the relative prevalence more accurately than do baselines, and the method's robustness to plausible violations of the covariate shift assumption.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2021

Reconciling Risk Allocation and Prevalence Estimation in Public Health Using Batched Bandits

In many public health settings, there is a perceived tension between all...
research
03/24/2022

Minimizing Uncertainty in Prevalence Estimates

Estimating prevalence, the fraction of a population with a certain medic...
research
10/13/2020

Humane Visual AI: Telling the Stories Behind a Medical Condition

A biological understanding is key for managing medical conditions, yet p...
research
08/03/2022

Prevalence Estimation and Optimal Classification Methods to Account for Time Dependence in Antibody Levels

Serology testing can identify past infection by quantifying the immune r...
research
11/21/2019

Geo-clustered chronic affinity: pathways from socio-economic disadvantages to health disparities

Our objective was to develop and test a new concept (affinity) analogous...
research
08/30/2023

Minimal Assumptions for Optimal Serology Classification: Theory and Implications for Multidimensional Settings and Impure Training Data

Minimizing error in prevalence estimates and diagnostic classifiers rema...

Please sign up or login with your details

Forgot password? Click here to reset