• DocumentCode
    3126162
  • Title

    Learning Protein Folding Energy Functions

  • Author

    Guan, Wei ; Ozakin, Arkadas ; Gray, Alexander ; Borreguero, Jose ; Pandit, Shashi ; Jagielska, Anna ; Wroblewska, Liliana ; Sko, Jeffrey

  • Author_Institution
    Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    1062
  • Lastpage
    1067
  • Abstract
    A critical open problem in emph{ab initio} protein folding is protein energy function design, which pertains to defining the energy of protein conformations in a way that makes folding most efficient and reliable. In this paper, we address this issue as a weight optimization problem and utilize a machine learning approach, learning-to-rank, to solve this problem. We investigate the ranking-via-classification approach, especially the Ranking SVM method and compare it with the state-of-the-art approach to the problem using the MINUIT optimization package. To maintain the physicality of the results, we impose non-negativity constraints on the weights. For this we develop two efficient non-negative support vector machine (NNSVM) methods, derived from L2-norm SVM and L1-norm SVMs, respectively. We demonstrate an energy function which maintains the correct ordering with respect to structure dissimilarity to the native state more often, is more efficient and reliable for learning on large protein sets, and is qualitatively superior to the current state-of-the-art energy function.
  • Keywords
    biology computing; optimisation; support vector machines; MINUIT optimization package; NNSVM; learning protein folding energy functions; machine learning; nonnegative support vector machine; optimization problem; protein conformations; ranking SVM method; Correlation; Optimization; Proteins; Reliability; Support vector machines; Vectors; emph{ab initio} protein folding; energy function; learning-to-rank; non-negativity constrained SVM optimization; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2011 IEEE 11th International Conference on
  • Conference_Location
    Vancouver,BC
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4577-2075-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2011.88
  • Filename
    6137315