Dimension Reduction of High-Dimensional Datasets Based on Stepwise SVM

11/09/2017
by   Elizabeth P. Chou, et al.
0

The current study proposes a dimension reduction method, stepwise support vector machine (SVM), to reduce the dimensions of large p small n datasets. The proposed method is compared with other dimension reduction methods, namely, the Pearson product difference correlation coefficient (PCCs), recursive feature elimination based on random forest (RF-RFE), and principal component analysis (PCA), by using five gene expression datasets. Additionally, the prediction performance of the variables selected by our method is evaluated. The study found that stepwise SVM can effectively select the important variables and achieve good prediction performance. Moreover, the predictions of stepwise SVM for reduced datasets was better than those for the unreduced datasets. The performance of stepwise SVM was more stable than that of PCA and RF-RFE, but the performance difference with respect to PCCs was minimal. It is necessary to reduce the dimensions of large p small n datasets. We believe that stepwise SVM can effectively eliminate noise in data and improve the prediction accuracy in any large p small n dataset.

READ FULL TEXT

page 10

page 11

page 12

page 13

page 14

page 15

research
06/05/2010

Rasch-based high-dimensionality data reduction and class prediction with applications to microarray gene expression data

Class prediction is an important application of microarray gene expressi...
research
01/20/2018

Efficient Text Classification Using Tree-structured Multi-linear Principal Component Analysis

A novel text data dimension reduction technique, called the tree-structu...
research
01/13/2019

Image retrieval method based on CNN and dimension reduction

An image retrieval method based on convolution neural network and dimens...
research
03/31/2021

Dimension reduction of open-high-low-close data in candlestick chart based on pseudo-PCA

The (open-high-low-close) OHLC data is the most common data form in the ...
research
11/17/2022

Data Dimension Reduction makes ML Algorithms efficient

Data dimension reduction (DDR) is all about mapping data from high dimen...
research
01/20/2018

Efficient Text Classification Using Tree-structured Multi-linear Principle Component Analysis

A novel text data dimension reduction technique, called the tree-structu...

Please sign up or login with your details

Forgot password? Click here to reset