• DocumentCode
    2109745
  • Title

    SVM-based prediction of the calpain degradome using Bayes Feature Extraction

  • Author

    Wee, L.J.K. ; Low, H.M.

  • Author_Institution
    Inst. for Infocomm Res., Singapore, Singapore
  • fYear
    2012
  • fDate
    Aug. 28 2012-Sept. 1 2012
  • Firstpage
    5534
  • Lastpage
    5540
  • Abstract
    Calpains belong to a family of calcium-dependent cysteine proteases which are implicated in a myriad of pathologies such as cancer and neurodegeneration. Despite extensive experimental studies on these proteases, our knowledge of the calpain degradome is still limited. Using a dataset of 341 unique, experimentally verified calpain cleavage sites, we conducted extensive sequence analyses and discovered novel residue propensities in the region flanking the cleavage site which could be modeled for prediction using machine learning algorithms. We have developed a series of computational models incorporating support vector machines and Bayes Feature Extraction for the prediction of calpain cleavage sites. The best models achieved AROC and accuracy scores ranging from 0.79 to 0.93 and 71% to 86% respectively when tested on independent test sets. We predicted calpain cleavage sites on proteins from the receptor tyrosine kinase family and discovered potential sites of cleavage at critical regulatory domains. The results suggest a novel role of calpains as a direct regulator of receptor tyrosine kinase activity in cell survival and cell death pathways.
  • Keywords
    Bayes methods; bioinformatics; biological techniques; enzymes; feature extraction; molecular biophysics; molecular configurations; support vector machines; Bayes feature extraction; SVM based calpain degradome prediction; calcium dependent cysteine proteases; calpain cleavage sites; cancer; cell death; cell survival; computational models; machine learning algorithms; neurodegeneration; receptor tyrosine kinase activity; regulatory domains; residue propensities; sequence analyses; support vector machines; Accuracy; Amino acids; Encoding; Proteins; Substrates; Support vector machines; Training; Algorithms; Artificial Intelligence; Bayes Theorem; Calpain; Proteolysis; Support Vector Machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
  • Conference_Location
    San Diego, CA
  • ISSN
    1557-170X
  • Print_ISBN
    978-1-4244-4119-8
  • Electronic_ISBN
    1557-170X
  • Type

    conf

  • DOI
    10.1109/EMBC.2012.6347248
  • Filename
    6347248