Classification of radiology reports by modality and anatomy: A comparative study

12/27/2018
by   Marina Bendersky, et al.
0

Data labeling is currently a time-consuming task that often requires expert knowledge. In research settings, the availability of correctly labeled data is crucial to ensure that model predictions are accurate and useful. We propose relatively simple machine learning-based models that achieve high performance metrics in the binary and multiclass classification of radiology reports. We compare the performance of these algorithms to that of a data-driven approach based on NLP, and find that the logistic regression classifier outperforms all other models, in both the binary and multiclass classification tasks. We then choose the logistic regression binary classifier to predict chest X-ray (CXR)/ non-chest X-ray (non-CXR) labels in reports from different datasets, unseen during any training phase of any of the models. Even in unseen report collections, the binary logistic regression classifier achieves average precision values of above 0.9. Based on the regression coefficient values, we also identify frequent tokens in CXR and non-CXR reports that are features with possibly high predictive power.

READ FULL TEXT
research
04/01/2021

Effect of Radiology Report Labeler Quality on Deep Learning Models for Chest X-Ray Interpretation

Although deep learning models for chest X-ray interpretation are commonl...
research
03/05/2019

What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features

While discriminative classifiers often yield strong predictive performan...
research
06/28/2021

Priority prediction of Asian Hornet sighting report using machine learning methods

As infamous invaders to the North American ecosystem, the Asian giant ho...
research
07/10/2021

Using Causal Analysis for Conceptual Deep Learning Explanation

Model explainability is essential for the creation of trustworthy Machin...
research
06/05/2023

German CheXpert Chest X-ray Radiology Report Labeler

This study aimed to develop an algorithm to automatically extract annota...
research
10/08/2015

The Knowledge Gradient with Logistic Belief Models for Binary Classification

We consider sequential decision making problems for binary classificatio...
research
12/14/2021

Classifying Emails into Human vs Machine Category

It is an essential product requirement of Yahoo Mail to distinguish betw...

Please sign up or login with your details

Forgot password? Click here to reset