Multi-feature Clustering of Step Data using Multivariate Functional Principal Component Analysis

10/15/2020
by   Wookyeong Song, et al.
0

This paper presents a new statistical method for clustering step data, a popular form of health record data easily obtained from wearable devices. Since step data are high-dimensional and zero-inflated, classical methods such as K-means and partitioning around medoid (PAM) cannot be applied directly. The proposed method is a novel combination of newly constructed variables that reflect the inherent features of step data, such as quantity, strength, and pattern, and a multivariate functional principal component analysis that can integrate all the features of the step data for clustering. The proposed method is implemented by applying a conventional clustering method such as K-means and PAM to the multivariate functional principal component scores obtained from these variables. Simulation studies and real data analysis demonstrate significant improvement in clustering quality.

READ FULL TEXT

page 15

page 18

page 22

research
12/08/2021

Tutorial on principal component analysis, with applications in R

This tutorial reviews the main steps of the principal component analysis...
research
11/22/2022

Factor-guided functional PCA for high-dimensional functional data

The literature on high-dimensional functional data focuses on either the...
research
08/01/2014

Functional Principal Component Analysis and Randomized Sparse Clustering Algorithm for Medical Image Analysis

Due to advances in sensors, growing large and complex medical image data...
research
10/18/2017

Fast PET Scan Tumor Segmentation using Superpixels, Principal Component Analysis and K-means Clustering

Positron Emission Tomography scan images are extensively used in radioth...
research
08/12/2020

Research on the construction method of vehicle driving cycle based on Mean Shift clustering

In this study, a novel method for the construction of a driving cycle ba...
research
10/07/2019

Nonparametric principal subspace regression

In scientific applications, multivariate observations often come in tand...
research
10/04/2021

Row-clustering of a Point Process-valued Matrix

Structured point process data harvested from various platforms poses new...

Please sign up or login with your details

Forgot password? Click here to reset