DocumentCode
2955003
Title
Gene Ontology term prediction based upon amino acid occurrence
Author
Taguchi, Y.-H. ; Gromiha, M. Michael
Author_Institution
Dept. of Phys., Chun Univ., Tokyo
fYear
2008
fDate
1-8 June 2008
Firstpage
615
Lastpage
620
Abstract
Usually prediction of molecular functions of proteins from their amino acid sequences is based upon sequence similarity with proteins of known functions. However, it is well known that function is mainly dependent upon protein structures than sequences. Since structures are often independent of sequences, it is important to predict function without sequence similarities. Here we propose a method based upon amino acid occurrence for predicting Gene Ontology (GO) term. We have tested the method in a set of 3212 proteins in Protein Data Bank with less than 40% sequence identity. Our method achieved more than 50% sensitivity and 20% precision for c.a. 20 selected GO terms among the most frequent 557 GO terms. Mean sensitivity, specificity, precision, and accuracy for relatively rare (but majority) 402 GO terms among the 557 GO terms are 13%, 99%, 9% and 99%, respectively. They are significantly larger than expected values of less than 2% under assuming random selection.
Keywords
biology computing; molecular biophysics; ontologies (artificial intelligence); proteins; sequences; amino acid occurrence; gene ontology term prediction; protein molecular function; protein structure; sequence similarity; Amino acids; Ontologies;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location
Hong Kong
ISSN
1098-7576
Print_ISBN
978-1-4244-1820-6
Electronic_ISBN
1098-7576
Type
conf
DOI
10.1109/IJCNN.2008.4633857
Filename
4633857
Link To Document