Real-time outlier detection for large datasets by RT-DetMCD

10/12/2019
by   Bart De Ketelaere, et al.
0

Modern industrial machines can generate gigabytes of data in seconds, frequently pushing the boundaries of available computing power. Together with the time criticality of industrial processing this presents a challenging problem for any data analytics procedure. We focus on the deterministic minimum covariance determinant method (DetMCD), which detects outliers by fitting a robust covariance matrix. We construct a much faster version of DetMCD by replacing its initial estimators by two new methods and incorporating update-based concentration steps. The computation time is reduced further by parallel computing, with a novel robust aggregation method to combine the results from the threads. The speed and accuracy of the proposed real-time DetMCD method (RT-DetMCD) are illustrated by simulation and a real industrial application to food sorting.

READ FULL TEXT

page 22

page 23

research
08/07/2020

Outlier detection in non-elliptical data by kernel MRCD

The minimum regularized covariance determinant method (MRCD) is a robust...
research
12/28/2019

Flagging and handling cellwise outliers by robust estimation of a covariance matrix

We propose a method for detecting cellwise outliers. Given a robust cova...
research
10/24/2021

Robust Variable Selection under Cellwise Contamination

Cellwise outliers are widespread in data and traditional robust methods ...
research
03/04/2015

Large Dimensional Analysis of Robust M-Estimators of Covariance with Outliers

A large dimensional characterization of robust M-estimators of covarianc...
research
05/16/2019

MAIA: A Microservices-based Architecture for Industrial Data Analytics

In recent decades, it has become a significant tendency for industrial m...
research
08/25/2023

Stochastic Configuration Machines for Industrial Artificial Intelligence

Real-time predictive modelling with desired accuracy is highly expected ...
research
05/21/2021

Covariance-Free Sparse Bayesian Learning

Sparse Bayesian learning (SBL) is a powerful framework for tackling the ...

Please sign up or login with your details

Forgot password? Click here to reset