Weakly supervised one-stage vision and language disease detection using large scale pneumonia and pneumothorax studies

07/31/2020
by   Leo K. Tam, et al.
20

Detecting clinically relevant objects in medical images is a challenge despite large datasets due to the lack of detailed labels. To address the label issue, we utilize the scene-level labels with a detection architecture that incorporates natural language information. We present a challenging new set of radiologist paired bounding box and natural language annotations on the publicly available MIMIC-CXR dataset especially focussed on pneumonia and pneumothorax. Along with the dataset, we present a joint vision language weakly supervised transformer layer-selected one-stage dual head detection architecture (LITERATI) alongside strong baseline comparisons with class activation mapping (CAM), gradient CAM, and relevant implementations on the NIH ChestXray-14 and MIMIC-CXR dataset. Borrowing from advances in vision language architectures, the LITERATI method demonstrates joint image and referring expression (objects localized in the image using natural language) input for detection that scales in a purely weakly supervised fashion. The architectural modifications address three obstacles – implementing a supervised vision and language detection method in a weakly supervised fashion, incorporating clinical referring expression natural language information, and generating high fidelity detections with map probabilities. Nevertheless, the challenging clinical nature of the radiologist annotations including subtle references, multi-instance specifications, and relatively verbose underlying medical reports, ensures the vision language detection task at scale remains stimulating for future investigation.

READ FULL TEXT
research
03/16/2023

VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection

The use of large-scale vision-language datasets is limited for object de...
research
12/31/2021

Weakly Supervised Change Detection Using Guided Anisotropic Difusion

Large scale datasets created from crowdsourced labels or openly availabl...
research
06/27/2023

A Weakly Supervised Classifier and Dataset of White Supremacist Language

We present a dataset and classifier for detecting the language of white ...
research
06/09/2023

Read, look and detect: Bounding box annotation from image-caption pairs

Various methods have been proposed to detect objects while reducing the ...
research
10/05/2018

Weakly Supervised Object Detection in Artworks

We propose a method for the weakly supervised detection of objects in pa...
research
04/10/2020

Weakly supervised multiple instance learning histopathological tumor segmentation

Histopathological image segmentation is a challenging and important topi...
research
07/31/2021

Chest ImaGenome Dataset for Clinical Reasoning

Despite the progress in automatic detection of radiologic findings from ...

Please sign up or login with your details

Forgot password? Click here to reset