DocumentCode :
478732
Title :
The Relationship between Protein Sequences and their Gene Ontology Functions
Author :
Duan, Zhong-Hui ; Hughes, Brent ; Reichel, Lothar ; Shi, Ting
Author_Institution :
Dept. of Comput. Sci., Akron Univ., OH
Volume :
1
fYear :
2006
fDate :
20-24 June 2006
Firstpage :
76
Lastpage :
83
Abstract :
The underlying assumption of many automated sequence annotation methods is that similar sequences imply similar biological functions. The present paper re-examines this assumption. A novel measure based on a set of local BLAST alignments is introduced to define the overall similarity between two protein sequences. The relationships between yeast protein sequences and their biological functions in the context of gene ontology categories are presented, and the effects of the level of gene ontology terms and the size of gene ontology groups on the degree of similarity are studied. The similarity distributions at different levels of gene ontology trees are considered. To evaluate the theoretical prediction power of similar sequences, we compute the posterior probability of correct predictions. The results indicate that the posterior probability can serve as an important measure for automated protein function prediction
Keywords :
biology computing; genetics; ontologies (artificial intelligence); proteins; sequences; statistical distributions; BLAST alignment; automated protein function prediction; automated sequence annotation method; biological function; gene ontology function; probability distribution; protein sequence; Bioinformatics; Biological information theory; Biological processes; Biology; Databases; Fungi; Genomics; Ontologies; Organisms; Proteins;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Computational Sciences, 2006. IMSCCS '06. First International Multi-Symposiums on
Conference_Location :
Hanzhou, Zhejiang
Print_ISBN :
0-7695-2581-4
Type :
conf
DOI :
10.1109/IMSCCS.2006.133
Filename :
4673528
Link To Document :
بازگشت