Title :
A method of two stage clustering using agglomerative hierarchical algorithms with one-pass k-means++ or k-median++
Author :
Tamura, Yoshinobu ; Miyamoto, Sadaaki
Author_Institution :
Master´s Program in Risk Eng., Univ. of Tsukuba, Tsukuba, Japan
Abstract :
The aim of this paper is to propose a two-stage method of clustering in which the first stage uses one-pass k-median++ and the second stage uses an agglomerative hierarchical clustering. To handle medians in the second stage, we proposed two calculation methods. One method uses L1 distance as similarity. Another uses error of L1 distance like the Ward method. In this paper, we compared proposed method and a two-stage method of our study which uses k-means++ in the first stage to examine the effectiveness of L1 distance in two-stage methods. Numerical experiments have been done using two criteria: objective function values and the Rand index.
Keywords :
pattern clustering; L1 distance; Rand index; Ward method; agglomerative hierarchical clustering; k-median++; median handling; objective function values; one-pass k-means++; two stage clustering; Clustering algorithms; Educational institutions; Electronic mail; Indexes; Linear programming; Moon; Resource management; agglomerative hierarchical clustering; k-means++; k-median++; two-stage clustering;
Conference_Titel :
Granular Computing (GrC), 2014 IEEE International Conference on
Conference_Location :
Noboribetsu
DOI :
10.1109/GRC.2014.6982850