A Transformer-based representation-learning model with unified processing of multimodal input for clinical diagnostics

06/01/2023
by   Hong-Yu Zhou, et al.
0

During the diagnostic process, clinicians leverage multimodal information, such as chief complaints, medical images, and laboratory-test results. Deep-learning models for aiding diagnosis have yet to meet this requirement. Here we report a Transformer-based representation-learning model as a clinical diagnostic aid that processes multimodal input in a unified manner. Rather than learning modality-specific features, the model uses embedding layers to convert images and unstructured and structured text into visual tokens and text tokens, and bidirectional blocks with intramodal and intermodal attention to learn a holistic representation of radiographs, the unstructured chief complaint and clinical history, structured clinical information such as laboratory-test results and patient demographic information. The unified model outperformed an image-only model and non-unified multimodal diagnosis models in the identification of pulmonary diseases (by 12 prediction of adverse clinical outcomes in patients with COVID-19 (by 29 7 help streamline triage of patients and facilitate the clinical decision process.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 11

page 17

research
11/29/2018

Improving Hospital Mortality Prediction with Medical Named Entities and Multimodal Learning

Clinical text provides essential information to estimate the acuity of a...
research
08/24/2021

Identification of Pediatric Respiratory Diseases Using Fine-grained Diagnosis System

Respiratory diseases, including asthma, bronchitis, pneumonia, and upper...
research
06/17/2022

Multimodal Attention-based Deep Learning for Alzheimer's Disease Diagnosis

Alzheimer's Disease (AD) is the most common neurodegenerative disorder w...
research
09/30/2021

Décomposition et analyse de tracés EMG pour aider au diagnostic des maladies neuromusculaires

The electromyogram (EMG) in needle detection represents one of the steps...
research
05/26/2023

Gender, Smoking History and Age Prediction from Laryngeal Images

Flexible laryngoscopy is commonly performed by otolaryngologists to dete...
research
09/26/2020

Bidirectional Representation Learning from Transformers using Multimodal Electronic Health Record Data for Chronic to Predict Depression

Advancements in machine learning algorithms have had a beneficial impact...
research
03/28/2023

Multimodal and multicontrast image fusion via deep generative models

Recently, it has become progressively more evident that classic diagnost...

Please sign up or login with your details

Forgot password? Click here to reset