DocumentCode :
2275140
Title :
The effect of cluster location and dataset size on 2-stage k-means algorithm
Author :
Salman, Raied ; Kecman, Vojislav
Author_Institution :
Comput. Sci. Dept., Virginia Commonwealth Univ., Richmond, VA, USA
fYear :
2011
fDate :
1-3 June 2011
Firstpage :
1
Lastpage :
5
Abstract :
Paper introduces the 2-stage k-means algorithm which is faster than the standard 1-stage k-means algorithm. The main idea of the 2-stages is to move, in the first stage (fast), the centers of the clusters closer to their final locations. This will be done by using a small part of the data to achieve faster calculation. The next stage (slow) stage will start from the centers found during the first stage (fast). Different initial locations of the clusters have been used while testing the algorithms here. With bigger datasets, it is shown that the 2-stage clustering method achieves better speed-up.
Keywords :
pattern clustering; 2-stage clustering method; 2-stage k-means algorithm; cluster location; dataset size; Algorithm design and analysis; Arrays; Clustering algorithms; Clustering methods; Convergence; Data mining; Program processors; Clustering; Data Mining; Distance Calculation; k-means algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics, Control, Measurement and Signals (ECMS), 2011 10th International Workshop on
Conference_Location :
Liberec
Print_ISBN :
978-1-61284-397-1
Electronic_ISBN :
978-1-61284-396-4
Type :
conf
DOI :
10.1109/IWECMS.2011.5952377
Filename :
5952377
Link To Document :
بازگشت