• DocumentCode
    2516211
  • Title

    Sequence-Based Prediction of Protein Folding Rates Using Contacts, Secondary Structures and Support Vector Machines

  • Author

    Lin, Guan Ning ; Wang, Zheng ; Xu, Dong ; Cheng, Jianlin

  • Author_Institution
    Inf. Inst., Univ. of Missouri, Columbia, MO, USA
  • fYear
    2009
  • fDate
    1-4 Nov. 2009
  • Firstpage
    3
  • Lastpage
    8
  • Abstract
    Predicting protein folding rate is useful for understanding protein folding process and guiding protein design. Most previous methods of predicting folding rate require the tertiary structure of a protein as an input. And most methods do not distinguish the different kinetic natures (two-state folding and multi-state folding) of the proteins. Here we developed a method, SeqRate, to predict both protein folding kinetic type (two-state versus multi-state) and real-value folding rate using features extracted from only protein sequence with support vector machines. On a standard benchmark dataset, the accuracy of folding kinetic type classification is 80%. The Pearson correlation coefficient and the mean absolute difference between predicted and experimental folding rates (sec-1) in the base-10 logarithmic scale are 0.81 and 0.79 for two-state protein folders, and 0.80 and 0.68 for three-state protein folders. SeqRate is the first sequence-based method for protein folding type classification and its accuracy of fold rate prediction is improved over previous sequence-based methods. Both the Web server and software of predicting folding rate are publicly available at http://casp.rnet.missouri.edu/fold_rate/index.html.
  • Keywords
    bioinformatics; feature extraction; file servers; molecular biophysics; molecular configurations; proteins; support vector machines; Pearson correlation coefficient; SeqRate; Web server; feature extraction; folding kinetic type classification; mean absolute difference; protein folding rates; secondary structures; sequence-based prediction; support vector machines; Bioinformatics; Biomedical informatics; Computer science; Databases; Kinetic theory; Protein engineering; Protein sequence; Support vector machine classification; Support vector machines; Topology; classifcation; folding rate; folding type; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine, 2009. BIBM '09. IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-0-7695-3885-3
  • Type

    conf

  • DOI
    10.1109/BIBM.2009.21
  • Filename
    5341882