A Survey of Mixed Data Clustering Algorithms

11/11/2018
by   Amir Ahmad, et al.
0

Most of the datasets normally contain either numeric or categorical features. Mixed data comprises of both numeric and categorical features, and they frequently occur in various domains, such as health, finance, marketing, etc. Clustering is often sought on mixed data to find structures and to group similar objects. However, clustering mixed data is challenging because it is difficult to directly apply mathematical operations, such as summation, average etc. on the feature values of these datasets. In this paper, we review various types of mixed data clustering techniques in detail. We present a taxonomy to identify ten types of different mixed data clustering techniques. We also compare the performance of several mixed data clustering methods on publicly available datasets. The paper further identifies challenges in developing different mixed data clustering algorithms and provides guidelines for future directions in this area.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2019

Hybrid Density- and Partition-based Clustering Algorithm for Data with Mixed-type Variables

Clustering is an essential technique for discovering patterns in data. T...
research
08/17/2016

Clustering Mixed Datasets Using Homogeneity Analysis with Applications to Big Data

Datasets with a mixture of numerical and categorical attributes are rout...
research
06/30/2020

Hierarchical Qualitative Clustering – clustering mixed datasets with critical qualitative information

Clustering can be used to extract insights from data or to verify some o...
research
09/21/2020

Learning Representation for Mixed Data Types with a Nonlinear Deep Encoder-Decoder Framework

Representation of data on mixed variables, numerical and categorical typ...
research
03/22/2021

Statistically-Robust Clustering Techniques for Mapping Spatial Hotspots: A Survey

Mapping of spatial hotspots, i.e., regions with significantly higher rat...
research
11/15/2012

Mixed LICORS: A Nonparametric Algorithm for Predictive State Reconstruction

We introduce 'mixed LICORS', an algorithm for learning nonlinear, high-d...
research
10/01/2022

Optimized Decoders for Mixed-Order Ambisonics

In this paper we discuss the motivation, design, and analysis of ambison...

Please sign up or login with your details

Forgot password? Click here to reset