Toward Cohort Intelligence: A Universal Cohort Representation Learning Framework for Electronic Health Record Analysis

by   Changshuo Liu, et al.

Electronic Health Records (EHR) are generated from clinical routine care recording valuable information of broad patient populations, which provide plentiful opportunities for improving patient management and intervention strategies in clinical practice. To exploit the enormous potential of EHR data, a popular EHR data analysis paradigm in machine learning is EHR representation learning, which first leverages the individual patient's EHR data to learn informative representations by a backbone, and supports diverse health-care downstream tasks grounded on the representations. Unfortunately, such a paradigm fails to access the in-depth analysis of patients' relevance, which is generally known as cohort studies in clinical practice. Specifically, patients in the same cohort tend to share similar characteristics, implying their resemblance in medical conditions such as symptoms or diseases. In this paper, we propose a universal COhort Representation lEarning (CORE) framework to augment EHR utilization by leveraging the fine-grained cohort information among patients. In particular, CORE first develops an explicit patient modeling task based on the prior knowledge of patients' diagnosis codes, which measures the latent relevance among patients to adaptively divide the cohorts for each patient. Based on the constructed cohorts, CORE recodes the pre-extracted EHR data representation from intra- and inter-cohort perspectives, yielding augmented EHR data representation learning. CORE is readily applicable to diverse backbone models, serving as a universal plug-in framework to infuse cohort information into healthcare methods for boosted performance. We conduct an extensive experimental evaluation on two real-world datasets, and the experimental results demonstrate the effectiveness and generalizability of CORE.


page 2

page 4


Representation Learning for Electronic Health Records

Information in electronic health records (EHR), such as clinical narrati...

Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula

Clinical guidelines underscore the importance of regularly monitoring an...

Language Models Are An Effective Patient Representation Learning Technique For Electronic Health Record Data

Widespread adoption of electronic health records (EHRs) has fueled devel...

Rediscovery of CNN's Versatility for Text-based Encoding of Raw Electronic Health Records

Making the most use of abundant information in electronic health records...

A study linking patient EHR data to external death data at Stanford Medicine

This manuscript explores linking real-world patient data with external d...

Bootstrapping Your Own Positive Sample: Contrastive Learning With Electronic Health Record Data

Electronic Health Record (EHR) data has been of tremendous utility in Ar...

Unsupervised Representation for EHR Signals and Codes as Patient Status Vector

Effective modeling of electronic health records presents many challenges...

Please sign up or login with your details

Forgot password? Click here to reset