DocumentCode
478732
Title
The Relationship between Protein Sequences and their Gene Ontology Functions
Author
Duan, Zhong-Hui ; Hughes, Brent ; Reichel, Lothar ; Shi, Ting
Author_Institution
Dept. of Comput. Sci., Akron Univ., OH
Volume
1
fYear
2006
fDate
20-24 June 2006
Firstpage
76
Lastpage
83
Abstract
The underlying assumption of many automated sequence annotation methods is that similar sequences imply similar biological functions. The present paper re-examines this assumption. A novel measure based on a set of local BLAST alignments is introduced to define the overall similarity between two protein sequences. The relationships between yeast protein sequences and their biological functions in the context of gene ontology categories are presented, and the effects of the level of gene ontology terms and the size of gene ontology groups on the degree of similarity are studied. The similarity distributions at different levels of gene ontology trees are considered. To evaluate the theoretical prediction power of similar sequences, we compute the posterior probability of correct predictions. The results indicate that the posterior probability can serve as an important measure for automated protein function prediction
Keywords
biology computing; genetics; ontologies (artificial intelligence); proteins; sequences; statistical distributions; BLAST alignment; automated protein function prediction; automated sequence annotation method; biological function; gene ontology function; probability distribution; protein sequence; Bioinformatics; Biological information theory; Biological processes; Biology; Databases; Fungi; Genomics; Ontologies; Organisms; Proteins;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Computational Sciences, 2006. IMSCCS '06. First International Multi-Symposiums on
Conference_Location
Hanzhou, Zhejiang
Print_ISBN
0-7695-2581-4
Type
conf
DOI
10.1109/IMSCCS.2006.133
Filename
4673528
Link To Document