Noisy Label Learning for Large-scale Medical Image Classification

03/06/2021
by   Fengbei Liu, et al.
15

The classification accuracy of deep learning models depends not only on the size of their training sets, but also on the quality of their labels. In medical image classification, large-scale datasets are becoming abundant, but their labels will be noisy when they are automatically extracted from radiology reports using natural language processing tools. Given that deep learning models can easily overfit these noisy-label samples, it is important to study training approaches that can handle label noise. In this paper, we adapt a state-of-the-art (SOTA) noisy-label multi-class training approach to learn a multi-label classifier for the dataset Chest X-ray14, which is a large scale dataset known to contain label noise in the training set. Given that this dataset also has label noise in the testing set, we propose a new theoretically sound method to estimate the performance of the model on a hidden clean testing data, given the result on the noisy testing data. Using our clean data performance estimation, we notice that the majority of label noise on Chest X-ray14 is present in the class 'No Finding', which is intuitively correct because this is the most likely class to contain one or more of the 14 diseases due to labelling mistakes.

READ FULL TEXT
research
03/03/2022

Semantic-guided Image Virtual Attribute Learning for Noisy Multi-label Chest X-ray Classification

Deep learning methods have shown outstanding classification accuracy in ...
research
12/04/2019

Epoch-wise label attacks for robustness against label noise

The current accessibility to large medical datasets for training convolu...
research
11/25/2021

ACPL: Anti-curriculum Pseudo-labelling forSemi-supervised Medical Image Classification

Effective semi-supervised learning (SSL) in medical im-age analysis (MIA...
research
12/23/2020

Noisy Labels Can Induce Good Representations

The current success of deep learning depends on large-scale labeled data...
research
05/19/2021

Correlated Input-Dependent Label Noise in Large-Scale Image Classification

Large scale image classification datasets often contain noisy labels. We...
research
11/02/2018

Learning from Large-scale Noisy Web Data with Ubiquitous Reweighting for Image Classification

Many advances of deep learning techniques originate from the efforts of ...
research
09/11/2021

Co-Correcting: Noise-tolerant Medical Image Classification via mutual Label Correction

With the development of deep learning, medical image classification has ...

Please sign up or login with your details

Forgot password? Click here to reset