• DocumentCode
    2307800
  • Title

    Initialization of cluster refinement algorithms: a review and comparative study

  • Author

    He, Ji ; Lan, Man ; Tan, Chew-Lim ; Sung, Sam-Yuan ; Low, Hwee-Boon

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Singapore, Singapore
  • Volume
    1
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Lastpage
    302
  • Abstract
    Various iterative refinement clustering methods are dependent on the initial state of the model and are capable of obtaining one of their local optima only. Since the task of identifying the global optimization is NP-hard, the study of the initialization method towards a sub-optimization is of great value. This paper reviews the various cluster initialization methods in the literature by categorizing them into three major families, namely random sampling methods, distance optimization methods, and density estimation methods. In addition, using a set of quantitative measures, we assess their performance on a number of synthetic and real-life data sets. Our controlled benchmark identifies two distance optimization methods, namely SCS and KKZ, as complements of the k-means learning characteristics towards a better cluster separation in the output solution.
  • Keywords
    data mining; learning (artificial intelligence); optimisation; pattern clustering; cluster initialization methods; cluster refinement algorithm; density estimation methods; distance optimization methods; k-means learning characteristics; random sampling methods; Clustering algorithms; Clustering methods; Data mining; Image segmentation; Iterative algorithms; Iterative methods; Optimization methods; Sampling methods; Sun; Unsupervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1379917
  • Filename
    1379917