مرکز منطقه ای اطلاع رساني علوم و فناوري - The effect of cluster location and dataset size on 2-stage k-means algorithm

DocumentCode :

2275140

Title :

The effect of cluster location and dataset size on 2-stage k-means algorithm

Author :

Salman, Raied ; Kecman, Vojislav

Author_Institution :

Comput. Sci. Dept., Virginia Commonwealth Univ., Richmond, VA, USA

fYear :

2011

fDate :

1-3 June 2011

Firstpage :

Lastpage :

Abstract :

Paper introduces the 2-stage k-means algorithm which is faster than the standard 1-stage k-means algorithm. The main idea of the 2-stages is to move, in the first stage (fast), the centers of the clusters closer to their final locations. This will be done by using a small part of the data to achieve faster calculation. The next stage (slow) stage will start from the centers found during the first stage (fast). Different initial locations of the clusters have been used while testing the algorithms here. With bigger datasets, it is shown that the 2-stage clustering method achieves better speed-up.

Keywords :

pattern clustering; 2-stage clustering method; 2-stage k-means algorithm; cluster location; dataset size; Algorithm design and analysis; Arrays; Clustering algorithms; Clustering methods; Convergence; Data mining; Program processors; Clustering; Data Mining; Distance Calculation; k-means algorithm;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Electronics, Control, Measurement and Signals (ECMS), 2011 10th International Workshop on

Conference_Location :

Liberec

Print_ISBN :

978-1-61284-397-1

Electronic_ISBN :

978-1-61284-396-4

Type :

conf

DOI :

10.1109/IWECMS.2011.5952377

Filename :

5952377

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2275140