DocumentCode :
1376127
Title :
A Framework for Incorporating Functional Interrelationships into Protein Function Prediction Algorithms
Author :
Zhang, Xiao-Fei ; Dai, Dao-Qing
Author_Institution :
Dept. of Math., Sun Yat-Sen Univ., Guangzhou, China
Volume :
9
Issue :
3
fYear :
2012
Firstpage :
740
Lastpage :
753
Abstract :
The functional annotation of proteins is one of the most important tasks in the post-genomic era. Although many computational approaches have been developed in recent years to predict protein function, most of these traditional algorithms do not take interrelationships among functional terms into account, such as different GO terms usually coannotate with some common proteins. In this study, we propose a new functional similarity measure in the form of Jaccard coefficient to quantify these interrelationships and also develop a framework for incorporating GO term similarity into protein function prediction process. The experimental results of cross-validation on S. cerevisiae and Homo sapiens data sets demonstrate that our method is able to improve the performance of protein function prediction. In addition, we find that small size terms associated with a few of proteins obtain more benefit than the large size ones when considering functional interrelationships. We also compare our similarity measure with other two widely used measures, and results indicate that when incorporated into function prediction algorithms, our proposed measure is more effective. Experiment results also illustrate that our algorithms outperform two previous competing algorithms, which also take functional interrelationships into account, in prediction accuracy. Finally, we show that our method is robust to annotations in the database which are not complete at present. These results give new insights about the importance of functional interrelationships in protein function prediction.
Keywords :
cellular biophysics; genomics; microorganisms; molecular biophysics; proteins; Homo sapiens data sets; Jaccard coefficient; S.cerevisiae; cerevisiae data sets; functional annotation; functional interrelationships; functional similarity measurement; post-genomic era; protein function prediction algorithms; traditional algorithms; Bioinformatics; Computational biology; Prediction algorithms; Proteins; RNA; Training; Training data; Gaussian random fields model.; Gene Ontology; Protein function prediction; protein-protein interaction; semantic similarity measure; Algorithms; Computational Biology; Databases, Protein; Humans; Protein Interaction Mapping; Proteins; Saccharomyces cerevisiae;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2011.148
Filename :
6081848
Link To Document :
بازگشت