Title :
Sequence-Based Prediction of Protein Folding Rates Using Contacts, Secondary Structures and Support Vector Machines
Author :
Lin, Guan Ning ; Wang, Zheng ; Xu, Dong ; Cheng, Jianlin
Author_Institution :
Inf. Inst., Univ. of Missouri, Columbia, MO, USA
Abstract :
Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic natures (two-state folding and multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using features extracted from only protein sequence with support vector machines. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec-1) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Both the Web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html.
Keywords :
bioinformatics; feature extraction; file servers; molecular biophysics; molecular configurations; proteins; support vector machines; Pearson correlation coefficient; SeqRate; Web server; feature extraction; folding kinetic type classification; mean absolute difference; protein folding rates; secondary structures; sequence-based prediction; support vector machines; Bioinformatics; Biomedical informatics; Computer science; Databases; Kinetic theory; Protein engineering; Protein sequence; Support vector machine classification; Support vector machines; Topology; classifcation; folding rate; folding type; support vector machine;
Conference_Titel :
Bioinformatics and Biomedicine, 2009. BIBM '09. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-0-7695-3885-3
DOI :
10.1109/BIBM.2009.21