Multi-Label Annotation of Chest Abdomen Pelvis Computed Tomography Text Reports Using Deep Learning

02/05/2021
by   Vincent M. D'Anniballe, et al.
0

To develop a high throughput multi-label annotator for body Computed Tomography (CT) reports that can be applied to a variety of diseases, organs, and cases. First, we used a dictionary approach to develop a rule-based algorithm (RBA) for extraction of disease labels from radiology text reports. We targeted three organ systems (lungs/pleura, liver/gallbladder, kidneys/ureters) with four diseases per system based on their prevalence in our dataset. To expand the algorithm beyond pre-defined keywords, an attention-guided recurrent neural network (RNN) was trained using the RBA-extracted labels to classify the reports as being positive for one or more diseases or normal for each organ system. Confounding effects on model performance were evaluated using random or pre-trained embedding as well as different sizes of training datasets. Performance was evaluated using the receiver operating characteristic (ROC) area under the curve (AUC) against 2,158 manually obtained labels. Our model extracted disease labels from 261,229 radiology reports of 112,501 unique subjects. Pre-trained models outperformed random embedding across all diseases. As the training dataset size was reduced, performance was robust except for a few diseases with relatively small number of cases. Pre-trained Classification AUCs achieved > 0.95 for all five disease outcomes across all three organ systems. Our label-extracting pipeline was able to encompass a variety of cases and diseases by generalizing beyond strict rules with exceptional accuracy. As a framework, this model can be easily adapted to enable automated labeling of hospital-scale medical data sets for training image-based disease classifiers.

READ FULL TEXT

page 4

page 5

research
08/03/2020

Weakly Supervised Multi-Organ Multi-Disease Classification of Body CT Scans

We designed a multi-organ, multi-label disease classification algorithm ...
research
01/12/2018

TieNet: Text-Image Embedding Network for Common Thorax Disease Classification and Reporting in Chest X-rays

Chest X-rays are one of the most common radiological examinations in dai...
research
06/09/2023

Automated Labeling of German Chest X-Ray Radiology Reports using Deep Learning

Radiologists are in short supply globally, and deep learning models offe...
research
06/11/2020

Automated Identification of Thoracic Pathology from Chest Radiographs with Enhanced Training Pipeline

Chest x-rays are the most common radiology studies for diagnosing lung a...
research
07/14/2021

Multi-Label Generalized Zero Shot Learning for the Classification of Disease in Chest Radiographs

Despite the success of deep neural networks in chest X-ray (CXR) diagnos...
research
09/11/2022

Learning to diagnose common thorax diseases on chest radiographs from radiology reports in Vietnamese

We propose a data collecting and annotation pipeline that extracts infor...

Please sign up or login with your details

Forgot password? Click here to reset