Multi-Objective Frequent Termset Clustering

by   Katharina Morik, et al.

Large media collections rapidly evolve in the World Wide Web. In addition to the targeted retrieval as is performed by search engines, browsing and explorative navigation is an important issue. Since the collections grow fast and authors most often do not annotate their web pages according to a given ontology, automatic structuring is in demand as a prerequisite for any pleasant human–computer interface. In this paper, we investigate the problem of finding alternative high-quality structures for navigation in a large collection of high-dimensional data. We express desired properties of frequent termset clustering (FTS) in terms of objective functions. In general, these functions are conflicting. This leads to the formulation of FTS clustering as a multi-objective optimization problem. The optimization is solved by a genetic algorithm. The result is a set of Pareto-optimal solutions. Users may choose their favorite type of a structure for their navigation through a collection or explore the different views given by the different optimal solutions. We explore the capability of the new approach to produce structures that are well suited for browsing on a social bookmarking data set.


page 18

page 23

page 24


An Analysis of the Admissibility of the Objective Functions Applied in Evolutionary Multi-objective Clustering

A variety of clustering criteria has been applied as an objective functi...

Creating Structure in Web Archives With Collections: Different Concepts From Web Archivists

As web archives' holdings grow, archivists subdivide them into collectio...

Multi-objective Semi-supervised Clustering for Finding Predictive Clusters

This study concentrates on clustering problems and aims to find compact ...

Simulation based Hardness Evaluation of a Multi-Objective Genetic Algorithm

Studies have shown that multi-objective optimization problems are hard p...

Finding Frequent Entities in Continuous Data

In many applications that involve processing high-dimensional data, it i...

Improved Multi-objective Data Stream Clustering with Time and Memory Optimization

The analysis of data streams has received considerable attention over th...

Please sign up or login with your details

Forgot password? Click here to reset