• DocumentCode
    73843
  • Title

    K-Means-Based Consensus Clustering: A Unified View

  • Author

    Junjie Wu ; Hongfu Liu ; Hui Xiong ; Jie Cao ; Jian Chen

  • Author_Institution
    Dept. of Inf. Syst., Beihang Univ., Beijing, China
  • Volume
    27
  • Issue
    1
  • fYear
    2015
  • fDate
    Jan. 1 2015
  • Firstpage
    155
  • Lastpage
    169
  • Abstract
    The objective of consensus clustering is to find a single partitioning which agrees as much as possible with existing basic partitionings. Consensus clustering emerges as a promising solution to find cluster structures from heterogeneous data. As an efficient approach for consensus clustering, the K-means based method has garnered attention in the literature, however the existing research efforts are still preliminary and fragmented. To that end, in this paper, we provide a systematic study of K-means-based consensus clustering (KCC). Specifically, we first reveal a necessary and sufficient condition for utility functions which work for KCC. This helps to establish a unified framework for KCC on both complete and incomplete data sets. Also, we investigate some important factors, such as the quality and diversity of basic partitionings, which may affect the performances of KCC. Experimental results on various realworld data sets demonstrate that KCC is highly efficient and is comparable to the state-of-the-art methods in terms of clustering quality. In addition, KCC shows high robustness to incomplete basic partitionings with many missing values.
  • Keywords
    pattern clustering; KCC; cluster structures; complete data sets; heterogeneous data; incomplete data sets; k-means-based consensus clustering; partitioning diversity; partitioning quality; utility functions; Clustering algorithms; Convex functions; Educational institutions; Linear programming; Partitioning algorithms; Robustness; Vectors; Consensus clustering; K-means; utility function;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2014.2316512
  • Filename
    6786489