Title :
The effect of cluster location and dataset size on 2-stage k-means algorithm
Author :
Salman, Raied ; Kecman, Vojislav
Author_Institution :
Comput. Sci. Dept., Virginia Commonwealth Univ., Richmond, VA, USA
Abstract :
Paper introduces the 2-stage k-means algorithm which is faster than the standard 1-stage k-means algorithm. The main idea of the 2-stages is to move, in the first stage (fast), the centers of the clusters closer to their final locations. This will be done by using a small part of the data to achieve faster calculation. The next stage (slow) stage will start from the centers found during the first stage (fast). Different initial locations of the clusters have been used while testing the algorithms here. With bigger datasets, it is shown that the 2-stage clustering method achieves better speed-up.
Keywords :
pattern clustering; 2-stage clustering method; 2-stage k-means algorithm; cluster location; dataset size; Algorithm design and analysis; Arrays; Clustering algorithms; Clustering methods; Convergence; Data mining; Program processors; Clustering; Data Mining; Distance Calculation; k-means algorithm;
Conference_Titel :
Electronics, Control, Measurement and Signals (ECMS), 2011 10th International Workshop on
Conference_Location :
Liberec
Print_ISBN :
978-1-61284-397-1
Electronic_ISBN :
978-1-61284-396-4
DOI :
10.1109/IWECMS.2011.5952377