MD-Manifold: A Medical-Distance-Based Representation Learning Approach for Medical Concept and Patient Representation

by   Shaodong Wang, et al.

Effectively representing medical concepts and patients is important for healthcare analytical applications. Representing medical concepts for healthcare analytical tasks requires incorporating medical domain knowledge and prior information from patient description data. Current methods, such as feature engineering and mapping medical concepts to standardized terminologies, have limitations in capturing the dynamic patterns from patient description data. Other embedding-based methods have difficulties in incorporating important medical domain knowledge and often require a large amount of training data, which may not be feasible for most healthcare systems. Our proposed framework, MD-Manifold, introduces a novel approach to medical concept and patient representation. It includes a new data augmentation approach, concept distance metric, and patient-patient network to incorporate crucial medical domain knowledge and prior data information. It then adapts manifold learning methods to generate medical concept-level representations that accurately reflect medical knowledge and patient-level representations that clearly identify heterogeneous patient cohorts. MD-Manifold also outperforms other state-of-the-art techniques in various downstream healthcare analytical tasks. Our work has significant implications in information systems research in representation learning, knowledge-driven machine learning, and using design science as middle-ground frameworks for downstream explorative and predictive analyses. Practically, MD-Manifold has the potential to create effective and generalizable representations of medical concepts and patients by incorporating medical domain knowledge and prior data information. It enables deeper insights into medical data and facilitates the development of new analytical applications for better healthcare outcomes.


page 33

page 36


Distributed representation of patients and its use for medical cost prediction

Efficient representation of patients is very important in the healthcare...

Unifying Heterogenous Electronic Health Records Systems via Text-Based Code Embedding

Substantial increase in the use of Electronic Health Records (EHRs) has ...

UniHPF : Universal Healthcare Predictive Framework with Zero Domain Knowledge

Despite the abundance of Electronic Healthcare Records (EHR), its hetero...

Patient Embeddings in Healthcare and Insurance Applications

The paper researches the problem of concept and patient representations ...

Statistical Latent Space Approach for Mixed Data Modelling and Applications

The analysis of mixed data has been raising challenges in statistics and...

MIMO: Mutual Integration of Patient Journey and Medical Ontology for Healthcare Representation Learning

Healthcare representation learning on the Electronic Health Record (EHR)...

Graph representation forecasting of patient's medical conditions: towards a digital twin

Objective: Modern medicine needs to shift from a wait and react, curativ...

Please sign up or login with your details

Forgot password? Click here to reset