• DocumentCode
    2151992
  • Title

    A mutual information based approach for evaluating the quality of clustering

  • Author

    Fattah, S.A. ; Lin, Chia-Chun ; Kung, Sun-Yuan

  • Author_Institution
    Dept. of Electr. Eng., Princeton Univ., Princeton, NJ, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    601
  • Lastpage
    604
  • Abstract
    In this paper, a new method for evaluating the quality of clustering of genes is proposed based on mutual information criterion. Instead of using the conventional histogram-based modeling method to assess clustering performance, we derive a normalized mutual information criterion utilizing the Gaussian kernel density estimator. In the computation of the mutual information, we propose to use only cluster-centroids instead of involving all the members, which offers a huge computational savings. The proposed algorithm not only considers the cluster size but also takes into consideration the homogeneity within a cluster. One major advantage of the proposed algorithm is that, it is capable of estimating an appropriate number of clusters. Extensive experimentation has been carried out on some synthetic data as well as the most widely used Yeast cell cycle gene expression data. Under various clustering conditions it is found that the proposed method provides an excellent performance in terms of measuring the quality of cluster and identifying the true number of cluster.
  • Keywords
    Gaussian processes; bioinformatics; pattern clustering; Gaussian kernel density estimator; Yeast cell cycle gene expression data; clustering quality evaluation; gene clustering; histogram-based modeling method; mutual information criterion approach; synthetic data; Classification algorithms; Clustering algorithms; Estimation; Gene expression; Kernel; Mutual information; Random variables; Mutual information; clustering; gene classification; kernel density estimator; microarray gene expression data; probability density function;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5946475
  • Filename
    5946475