Synthetic Sampling for Multi-Class Malignancy Prediction

07/07/2018
by   Matthew Yung, et al.
0

We explore several oversampling techniques for an imbalanced multi-label classification problem, a setting often encountered when developing models for Computer-Aided Diagnosis (CADx) systems. While most CADx systems aim to optimize classifiers for overall accuracy without considering the relative distribution of each class, we look into using synthetic sampling to increase per-class performance when predicting the degree of malignancy. Using low-level image features and a random forest classifier, we show that using synthetic oversampling techniques increases the sensitivity of the minority classes by an average of 7.22 for a particular minority class. Furthermore, the analysis of low-level image feature distributions for the synthetic nodules reveals that these nodules can provide insights on how to preprocess image data for better classification performance or how to supplement the original datasets when more data acquisition is feasible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2019

Synthetic Oversampling of Multi-Label Data based on Local Label Distribution

Class-imbalance is an inherent characteristic of multi-label data which ...
research
02/04/2023

Conformalized semi-supervised random forest for classification and abnormality detection

Traditional classifiers infer labels under the premise that the training...
research
01/05/2022

Detection of extragalactic Ultra-Compact Dwarfs and Globular Clusters using Explainable AI techniques

Compact stellar systems such as Ultra-compact dwarfs (UCDs) and Globular...
research
08/08/2018

Additional Representations for Improving Synthetic Aperture Sonar Classification Using Convolutional Neural Networks

Object classification in synthetic aperture sonar (SAS) imagery is usual...
research
10/25/2018

Superensemble Classifier for Improving Predictions in Imbalanced Datasets

Learning from an imbalanced dataset is a tricky proposition. Because the...
research
10/30/2017

Continuous Authentication Using One-class Classifiers and their Fusion

While developing continuous authentication systems (CAS), we generally a...

Please sign up or login with your details

Forgot password? Click here to reset