• DocumentCode
    3696010
  • Title

    A Comparative Study of Selective Cluster Ensemble for Document Clustering

  • Author

    Sen Xu;Jun Gao;Xiufang Xu;Xianfeng Li;Hualong Yu

  • Author_Institution
    Sch. of Inf. Eng., Yancheng Inst. of Technol., Yancheng, China
  • Volume
    1
  • fYear
    2015
  • Firstpage
    308
  • Lastpage
    311
  • Abstract
    Spherical K-means algorithm (SKM) has been widely used for document clustering. Yet it is prone to getting stuck at local optimums. Recently, cluster ensemble has shown to be an effective method in improving the performance of single clustering algorithm such as SKM by combining multiple clustering solutions resulted by SKM. This paper describes and compares three graph partitioning algorithms and four hierarchy clustering algorithms for document clustering, both theoretically and empirically, using a selective cluster ensemble framework. Our experimental results over several document datasets show that, in terms of normalized mutual information, (a) The hierarchy clustering algorithms produce better clustering results than graph partitioning algorithms (b) on most datasets, the best results are arrived when the size of ensemble is between 40 and 60.
  • Keywords
    "Clustering algorithms","Partitioning algorithms","Algorithm design and analysis","Chlorine","Mutual information","Classification algorithms","Image segmentation"
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2015 7th International Conference on
  • Print_ISBN
    978-1-4799-8645-3
  • Type

    conf

  • DOI
    10.1109/IHMSC.2015.268
  • Filename
    7334710