Transforming variables to central normality

05/16/2020
by   Jakob Raymaekers, et al.
23

Many real data sets contain features (variables) whose distribution is far from normal (gaussian). Instead, their distribution is often skewed. In order to handle such data it is customary to preprocess the variables to make them more normal. The Box-Cox and Yeo-Johnson transformations are well-known tools for this. However, the standard maximum likelihood estimator of their transformation parameter is highly sensitive to outliers, and will often try to move outliers inward at the expense of the normality of the central part of the data. We propose an automatic preprocessing technique that is robust against such outliers, which transforms the data to central normality. It compares favorably to existing techniques in an extensive simulation study and on real data.

READ FULL TEXT
research
01/16/2018

On a bimodal Birnbaum-Saunders distribution with applications to lifetime data

The Birnbaum-Saunders distribution is a flexible and useful model which ...
research
07/25/2018

Exponentiated Discrete Lindley Distribution: Properties and Applications

In this article, the exponentiated discrete Lindley distribution is pres...
research
08/04/2021

An autoregressive model for a censored data denoising method robust to outliers with application to the Obépine SARS-Cov-2 monitoring

A sentinel network, Obépine, has been designed to monitor SARS-CoV-2 vir...
research
03/14/2021

Optimal monomial quadratization for ODE systems

Quadratization problem is, given a system of ODEs with polynomial right-...
research
09/21/2022

A robust measure of skewness using cumulative statistic calculation

An important aspect of the shape of a distribution is the level of asymm...
research
03/23/2017

Robustness of Maximum Correntropy Estimation Against Large Outliers

The maximum correntropy criterion (MCC) has recently been successfully a...

Please sign up or login with your details

Forgot password? Click here to reset