svcR: An R Package for Support Vector Clustering improved with Geometric Hashing applied to Lexical Pattern Discovery

04/23/2015
by   Nicolas Turenne, et al.
0

We present a new R package which takes a numerical matrix format as data input, and computes clusters using a support vector clustering method (SVC). We have implemented an original 2D-grid labeling approach to speed up cluster extraction. In this sense, SVC can be seen as an efficient cluster extraction if clusters are separable in a 2-D map. Secondly we showed that this SVC approach using a Jaccard-Radial base kernel can help to classify well enough a set of terms into ontological classes and help to define regular expression rules for information extraction in documents; our case study concerns a set of terms and documents about developmental and molecular biology.

READ FULL TEXT

page 10

page 19

page 20

page 23

research
04/19/2023

Accelerate Support Vector Clustering via Spectrum-Preserving Data Compression

Support vector clustering is an important clustering method. However, it...
research
12/10/2018

Ramp-based Twin Support Vector Clustering

Traditional plane-based clustering methods measure the cost of within-cl...
research
02/27/2017

Mutual Information based labelling and comparing clusters

After a clustering solution is generated automatically, labelling these ...
research
02/20/2023

Information Retrieval in long documents: Word clustering approach for improving Semantics

In this paper, we propose an alternative to deep neural networks for sem...
research
06/12/2015

Optimal γ and C for ε-Support Vector Regression with RBF Kernels

The objective of this study is to investigate the efficient determinatio...
research
09/19/2010

Pair-Wise Cluster Analysis

This paper studies the problem of learning clusters which are consistent...

Please sign up or login with your details

Forgot password? Click here to reset