Self-paced Principal Component Analysis

06/25/2021
by   Zhao Kang, et al.
39

Principal Component Analysis (PCA) has been widely used for dimensionality reduction and feature extraction. Robust PCA (RPCA), under different robust distance metrics, such as l1-norm and l2, p-norm, can deal with noise or outliers to some extent. However, real-world data may display structures that can not be fully captured by these simple functions. In addition, existing methods treat complex and simple samples equally. By contrast, a learning pattern typically adopted by human beings is to learn from simple to complex and less to more. Based on this principle, we propose a novel method called Self-paced PCA (SPCA) to further reduce the effect of noise and outliers. Notably, the complexity of each sample is calculated at the beginning of each iteration in order to integrate samples from simple to more complex into training. Based on an alternating optimization, SPCA finds an optimal projection matrix and filters out outliers iteratively. Theoretical analysis is presented to show the rationality of SPCA. Extensive experiments on popular data sets demonstrate that the proposed method can improve the state of-the-art results considerably.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 8

research
04/13/2019

Self-Paced Probabilistic Principal Component Analysis for Data with Outliers

Principal Component Analysis (PCA) is a popular tool for dimensionality ...
research
05/23/2020

Principal Component Analysis Based on Tℓ_1-norm Maximization

Classical principal component analysis (PCA) may suffer from the sensiti...
research
11/22/2020

Angular Embedding: A New Angular Robust Principal Component Analysis

As a widely used method in machine learning, principal component analysi...
research
02/10/2010

Intrinsic dimension estimation of data by principal component analysis

Estimating intrinsic dimensionality of data is a classic problem in patt...
research
06/17/2021

Pre-treatment of outliers and anomalies in plant data: Methodology and case study of a Vacuum Distillation Unit

Data pre-treatment plays a significant role in improving data quality, t...
research
08/31/2020

Directional Assessment of Traffic Flow Extremes

We analyze extremes of traffic flow profiles composed of traffic counts ...
research
09/14/2020

Principle Component Analysis for Classification of the Quality of Aromatic Rice

This research introduces an instrument for performing quality control on...

Please sign up or login with your details

Forgot password? Click here to reset