DocumentCode :
2151992
Title :
A mutual information based approach for evaluating the quality of clustering
Author :
Fattah, S.A. ; Lin, Chia-Chun ; Kung, Sun-Yuan
Author_Institution :
Dept. of Electr. Eng., Princeton Univ., Princeton, NJ, USA
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
601
Lastpage :
604
Abstract :
In this paper, a new method for evaluating the quality of clustering of genes is proposed based on mutual information criterion. Instead of using the conventional histogram-based modeling method to assess clustering performance, we derive a normalized mutual information criterion utilizing the Gaussian kernel density estimator. In the computation of the mutual information, we propose to use only cluster-centroids instead of involving all the members, which offers a huge computational savings. The proposed algorithm not only considers the cluster size but also takes into consideration the homogeneity within a cluster. One major advantage of the proposed algorithm is that, it is capable of estimating an appropriate number of clusters. Extensive experimentation has been carried out on some synthetic data as well as the most widely used Yeast cell cycle gene expression data. Under various clustering conditions it is found that the proposed method provides an excellent performance in terms of measuring the quality of cluster and identifying the true number of cluster.
Keywords :
Gaussian processes; bioinformatics; pattern clustering; Gaussian kernel density estimator; Yeast cell cycle gene expression data; clustering quality evaluation; gene clustering; histogram-based modeling method; mutual information criterion approach; synthetic data; Classification algorithms; Clustering algorithms; Estimation; Gene expression; Kernel; Mutual information; Random variables; Mutual information; clustering; gene classification; kernel density estimator; microarray gene expression data; probability density function;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5946475
Filename :
5946475
Link To Document :
بازگشت