DocumentCode
2844974
Title
Comparison of four similarity measures based on GO annotations for Gene Clustering
Author
Al Mubaid, H. ; Nagar, Anurag
Author_Institution
Univ. of Houston-Clear Lake, Houston, TX
fYear
2008
fDate
6-9 July 2008
Firstpage
531
Lastpage
536
Abstract
Gene ontology (GO) has fast become a dependable source for determining gene functions, gene similarity, and gene clustering. Furthermore, using GO and gene annotation databases, with semantic similarity measures, is now more acceptable in bioinformatics as means for gene functional analysis. In this paper, we compare four semantic similarity measures to compute the similarity between genes using GO annotations within a gene clustering application. The similarity measures we examined in this study rely on different information sources and techniques in computing the similarity between genes. For example, we selected two measures that are based on information-content, one measure is based on ontology-structure, and the fourth measure is based on annotation terms within the GO hierarchy. In the evaluation, we used five gene datasets from yeast genome, and we analyzed the results based on various clustering metrics and criteria.
Keywords
biology computing; database management systems; genetics; ontologies (artificial intelligence); pattern clustering; bioinformatics; clustering metrics; gene annotation databases; gene clustering; gene functional analysis; gene ontology annotations; gene similarity; semantic similarity measures; yeast genome; Bioinformatics; Databases; Functional analysis; Fungi; Genomics; Lakes; Length measurement; Ontologies; Proteins; Time measurement; GO clustering; Gene clustering; gene similarity;
fLanguage
English
Publisher
ieee
Conference_Titel
Computers and Communications, 2008. ISCC 2008. IEEE Symposium on
Conference_Location
Marrakech
ISSN
1530-1346
Print_ISBN
978-1-4244-2702-4
Electronic_ISBN
1530-1346
Type
conf
DOI
10.1109/ISCC.2008.4625763
Filename
4625763
Link To Document