Feature Selection for Regression Problems Based on the Morisita Estimator of Intrinsic Dimension

01/31/2016
by   Jean Golay, et al.
0

Data acquisition, storage and management have been improved, while the key factors of many phenomena are not well known. Consequently, irrelevant and redundant features artificially increase the size of datasets, which complicates learning tasks, such as regression. To address this problem, feature selection methods have been proposed. This paper introduces a new supervised filter based on the Morisita estimator of intrinsic dimension. It can identify relevant features and distinguish between redundant and irrelevant information. Besides, it offers a clear graphical representation of the results, and it can be easily implemented in different programming languages. Comprehensive numerical experiments are conducted using simulated datasets characterized by different levels of complexity, sample size and noise. The suggested algorithm is also successfully tested on a selection of real world applications and compared with RReliefF using extreme learning machine. In addition, a new measure of feature relevance is presented and discussed.

READ FULL TEXT
research
08/19/2016

Unsupervised Feature Selection Based on the Morisita Estimator of Intrinsic Dimension

This paper deals with a new filter algorithm for selecting the smallest ...
research
10/12/2020

On Feature Selection Using Anisotropic General Regression Neural Network

The presence of irrelevant features in the input dataset tends to reduce...
research
03/17/2019

Deep Feature Selection using a Teacher-Student Network

High-dimensional data in many machine learning applications leads to com...
research
04/30/2019

A scalable saliency-based Feature selection method with instance level information

Classic feature selection techniques remove those features that are eith...
research
02/19/2019

Feature Selection for Better Spectral Characterization or: How I Learned to Start Worrying and Love Ensembles

An ever-looming threat to astronomical applications of machine learning ...
research
10/11/2018

MOANOFS: Multi-Objective Automated Negotiation based Online Feature Selection System for Big Data Classification

Feature Selection (FS) plays an important role in learning and classific...
research
06/15/2021

Employing an Adjusted Stability Measure for Multi-Criteria Model Fitting on Data Sets with Similar Features

Fitting models with high predictive accuracy that include all relevant b...

Please sign up or login with your details

Forgot password? Click here to reset