• DocumentCode
    3455455
  • Title

    A Combination Approach to Cluster Validation Based on Statistical Quantiles

  • Author

    Albalate, Amparo ; Suendermann, David

  • Author_Institution
    Inst. of Inf. Technol., Univ. of Ulm, Ulm, Germany
  • fYear
    2009
  • fDate
    3-5 Aug. 2009
  • Firstpage
    549
  • Lastpage
    555
  • Abstract
    In this paper, we analyse different techniques to detect the number of clusters in a dataset, also know as cluster validation techniques. We also propose a new algorithm based on the combination of several validation indexes to simultaneously validate several partitions of a dataset generated by different clustering techniques and object distances. The existing validation techniques as well as the combination algorithm have been tested on three data sets: a synthesized mixture of Gaussians data set, the NCI60 microarray data set, and the Iris data set. Evaluation results have shown the adequate performance of the proposed approach, even if the input validity scores fail to discover the true number of clusters.
  • Keywords
    pattern clustering; statistical analysis; Gaussians data set; NCI60 microarray data set; combination algorithm; dataset cluster validation technique; object distance; statistical quantile; validation index; Bioinformatics; Biology computing; Clustering algorithms; Information analysis; Information technology; Intelligent systems; Partitioning algorithms; Speech analysis; Systems biology; Testing; Cluster Validation; Quantile;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3739-9
  • Type

    conf

  • DOI
    10.1109/IJCBS.2009.116
  • Filename
    5260453