DocumentCode :
2779014
Title :
Automatic estimation total number of cluster using a hybrid test-and-generate and K-means algorithm
Author :
Mahmuddin, M. ; Yusof, Y.
Author_Institution :
Coll. of Arts & Sci., Univ. Utara Malaysia, Sintok, Malaysia
fYear :
2010
fDate :
5-8 Dec. 2010
Firstpage :
593
Lastpage :
596
Abstract :
K-mean algorithm requires total number cluster, k beforehand in order the algorithm operates correctly. This pre-requisite value is needed to ensure the algorithm works on the tested data. In this paper, a test-and-generate approach is applied to estimate total number present in a data. A hybrid Bees Algorithm and cluster validity index are used for this purpose. The modified Bees algorithm is used to find near-optimal cluster centres (centroids) whereas cluster validity index is employed to examine `goodness´ of the generated clusters. A series of experiments using some benchmarking data sets are undertaken to evaluate effectiveness of the proposed approach. A promising results show that the proposed approach is capable to estimate total number of cluster in a data.
Keywords :
pattern clustering; hybrid bees algorithm; k-means algorithm; test and generate algorithm; total cluster number estimation; Clustering algorithms; Educational institutions; Indexes; Machine learning; Machine learning algorithms; Noise measurement; Sun; Cluster validity index; Clustering; K-means; Total number of cluster;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Applications and Industrial Electronics (ICCAIE), 2010 International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-9054-7
Type :
conf
DOI :
10.1109/ICCAIE.2010.5735150
Filename :
5735150
Link To Document :
بازگشت