• DocumentCode
    1340733
  • Title

    DICLENS: Divisive Clustering Ensemble with Automatic Cluster Number

  • Author

    Mimaroglu, S. ; Aksehirli, E.

  • Author_Institution
    Dept. of Comput. Eng., Bahcesehir Univ., Istanbul, Turkey
  • Volume
    9
  • Issue
    2
  • fYear
    2012
  • Firstpage
    408
  • Lastpage
    420
  • Abstract
    Clustering has a long and rich history in a variety of scientific fields. Finding natural groupings of a data set is a hard task as attested by hundreds of clustering algorithms in the literature. Each clustering technique makes some assumptions about the underlying data set. If the assumptions hold, good clusterings can be expected. It is hard, in some cases impossible, to satisfy all the assumptions. Therefore, it is beneficial to apply different clustering methods on the same data set, or the same method with varying input parameters or both. We propose a novel method, DICLENS, which combines a set of clusterings into a final clustering having better overall quality. Our method produces the final clustering automatically and does not take any input parameters, a feature missing in many existing algorithms. Extensive experimental studies on real, artificial, and gene expression data sets demonstrate that DICLENS produces very good quality clusterings in a short amount of time. DICLENS implementation runs on standard personal computers by being scalable, and by consuming very little memory and CPU.
  • Keywords
    genetics; medical computing; pattern clustering; tree data structures; DICLENS; divisive clustering ensemble with automatic cluster number; gene expression data; gene expressions; minimum spanning tree; Bioinformatics; Clustering algorithms; Clustering methods; Computational biology; Gene expression; Partitioning algorithms; Software algorithms; Clustering; cluster ensembles; combining multiple clusterings; consensus clustering; evidence accumulation; gene expressions.; minimum spanning tree; Algorithms; Cluster Analysis; Computational Biology; Databases, Genetic; Gene Expression Profiling; Humans; Neoplasms;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2011.129
  • Filename
    6035671