Title of article :
Predicting rRNA-, RNA-, and DNA-binding proteins from primary structure with support vector machines
Author/Authors :
Yu، نويسنده , , Xiaojing and Cao، نويسنده , , Jianping and Cai، نويسنده , , Yudong and Shi، نويسنده , , Tieliu and Li، نويسنده , , Yixue، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2006
Pages :
10
From page :
175
To page :
184
Abstract :
In the post-genome era, the prediction of protein function is one of the most demanding tasks in the study of bioinformatics. Machine learning methods, such as the support vector machines (SVMs), greatly help to improve the classification of protein function. s work, we integrated SVMs, protein sequence amino acid composition, and associated physicochemical properties into the study of nucleic-acid-binding proteins prediction. We developed the binary classifications for rRNA-, RNA-, DNA-binding proteins that play an important role in the control of many cell processes. Each SVM predicts whether a protein belongs to rRNA-, RNA-, or DNA-binding protein class. Self-consistency and jackknife tests were performed on the protein data sets in which the sequences identity was <25%. Test results show that the accuracies of rRNA-, RNA-, DNA-binding SVMs predictions are ∼84%, ∼78%, ∼72%, respectively. The predictions were also performed on the ambiguous and negative data set. The results demonstrate that the predicted scores of proteins in the ambiguous data set by RNA- and DNA-binding SVM models were distributed around zero, while most proteins in the negative data set were predicted as negative scores by all three SVMs. The score distributions agree well with the prior knowledge of those proteins and show the effectiveness of sequence associated physicochemical properties in the protein function prediction. The software is available from the author upon request.
Keywords :
RNA-binding protein , DNA-binding protein , protein function prediction , Support vector machines (SVMs) , rRNA-binding protein
Journal title :
Journal of Theoretical Biology
Serial Year :
2006
Journal title :
Journal of Theoretical Biology
Record number :
1537621
Link To Document :
بازگشت