DocumentCode :
1654136
Title :
Validating Clustering for Gene Expression Data Based on Semantic Distance of Gene Ontology Terms
Author :
Wu, Feizhen ; Ma, Wenli ; Wang, Mei ; Chen, Qilong ; Zheng, Wenling
Author_Institution :
Bioelectornic Center, Shanghai Univ., Shanghai
fYear :
2008
Firstpage :
706
Lastpage :
709
Abstract :
Clustering algorithms for gene expression data attempt to partition the gene expression data into groups, which exhibits similar patterns of variation in expression level. Many clustering algorithms have been proposed, but little guidance is available to evaluate the clustering result from biological meaning. We developed a new algorithm to measure semantic distance between Gene Ontology (GO) terms. Based on this algorithm, we proposed a novel method to assess the biological predictive power of the clustering algorithms: among a cluster, the more similar the functions of genes are, the lower the semantic distance is. We applied the approach to evaluating hierarchical clustering algorithms for yeast cell and diabetes datasets, and successfully obtained the biological features of the gene clusters. We found the approach may contribute to achieve better clustering results.
Keywords :
cellular biophysics; diseases; genetics; medical computing; microorganisms; ontologies (artificial intelligence); pattern clustering; biological predictive power; diabetes dataset; gene cluster; gene expression data; gene ontology; hierarchical clustering algorithm; pattern clustering; semantic distance; yeast cell; Biological materials; Cells (biology); Clustering algorithms; Data analysis; Diabetes; Endocrine system; Fungi; Gene expression; Ontologies; Partitioning algorithms;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering, 2008. ICBBE 2008. The 2nd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-1747-6
Electronic_ISBN :
978-1-4244-1748-3
Type :
conf
DOI :
10.1109/ICBBE.2008.172
Filename :
4535052
Link To Document :
بازگشت