Co-occurrence of medical conditions: Exposing patterns through probabilistic topic modeling of SNOMED codes

by   Moumita Bhattacharya, et al.

Patients associated with multiple co-occurring health conditions often face aggravated complications and less favorable outcomes. Co-occurring conditions are especially prevalent among individuals suffering from kidney disease, an increasingly widespread condition affecting 13 the US. This study aims to identify and characterize patterns of co-occurring medical conditions in patients employing a probabilistic framework. Specifically, we apply topic modeling in a non-traditional way to find associations across SNOMEDCT codes assigned and recorded in the EHRs of>13,000 patients diagnosed with kidney disease. Unlike most prior work on topic modeling, we apply the method to codes rather than to natural language. Moreover, we quantitatively evaluate the topics, assessing their tightness and distinctiveness, and also assess the medical validity of our results. Our experiments show that each topic is succinctly characterized by a few highly probable and unique disease codes, indicating that the topics are tight. Furthermore, inter-topic distance between each pair of topics is typically high, illustrating distinctiveness. Last, most coded conditions grouped together within a topic, are indeed reported to co-occur in the medical literature. Notably, our results uncover a few indirect associations among conditions that have hitherto not been reported as correlated in the medical literature.


page 1

page 2

page 3

page 4

page 5

page 6

page 8

page 9


Identifying Patterns of Associated-Conditions through Topic Models of Electronic Medical Records

Multiple adverse health conditions co-occurring in a patient are typical...

Topic Modeling on Clinical Social Work Notes for Exploring Social Determinants of Health Factors

Most research studying social determinants of health (SDoH) has focused ...

Supervised multi-specialist topic model with applications on large-scale electronic health record data

Motivation: Electronic health record (EHR) data provides a new venue to ...

Mining Themes in Clinical Notes to Identify Phenotypes and to Predict Length of Stay in Patients admitted with Heart Failure

Heart failure is a syndrome which occurs when the heart is not able to p...

Temporal Topic Modeling to Assess Associations between News Trends and Infectious Disease Outbreaks

In retrospective assessments, internet news reports have been shown to c...

Viewpoint and Topic Modeling of Current Events

There are multiple sides to every story, and while statistical topic mod...

POPDx: An Automated Framework for Patient Phenotyping across 392,246 Individuals in the UK Biobank Study

Objective For the UK Biobank standardized phenotype codes are associated...

Please sign up or login with your details

Forgot password? Click here to reset