DocumentCode
2307800
Title
Initialization of cluster refinement algorithms: a review and comparative study
Author
He, Ji ; Lan, Man ; Tan, Chew-Lim ; Sung, Sam-Yuan ; Low, Hwee-Boon
Author_Institution
Sch. of Comput., Nat. Univ. of Singapore, Singapore
Volume
1
fYear
2004
fDate
25-29 July 2004
Lastpage
302
Abstract
Various iterative refinement clustering methods are dependent on the initial state of the model and are capable of obtaining one of their local optima only. Since the task of identifying the global optimization is NP-hard, the study of the initialization method towards a sub-optimization is of great value. This paper reviews the various cluster initialization methods in the literature by categorizing them into three major families, namely random sampling methods, distance optimization methods, and density estimation methods. In addition, using a set of quantitative measures, we assess their performance on a number of synthetic and real-life data sets. Our controlled benchmark identifies two distance optimization methods, namely SCS and KKZ, as complements of the k-means learning characteristics towards a better cluster separation in the output solution.
Keywords
data mining; learning (artificial intelligence); optimisation; pattern clustering; cluster initialization methods; cluster refinement algorithm; density estimation methods; distance optimization methods; k-means learning characteristics; random sampling methods; Clustering algorithms; Clustering methods; Data mining; Image segmentation; Iterative algorithms; Iterative methods; Optimization methods; Sampling methods; Sun; Unsupervised learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-8359-1
Type
conf
DOI
10.1109/IJCNN.2004.1379917
Filename
1379917
Link To Document