DocumentCode :
2109745
Title :
SVM-based prediction of the calpain degradome using Bayes Feature Extraction
Author :
Wee, L.J.K. ; Low, H.M.
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
fYear :
2012
fDate :
Aug. 28 2012-Sept. 1 2012
Firstpage :
5534
Lastpage :
5540
Abstract :
Calpains belong to a family of calcium-dependent cysteine proteases which are implicated in a myriad of pathologies such as cancer and neurodegeneration. Despite extensive experimental studies on these proteases, our knowledge of the calpain degradome is still limited. Using a dataset of 341 unique, experimentally verified calpain cleavage sites, we conducted extensive sequence analyses and discovered novel residue propensities in the region flanking the cleavage site which could be modeled for prediction using machine learning algorithms. We have developed a series of computational models incorporating support vector machines and Bayes Feature Extraction for the prediction of calpain cleavage sites. The best models achieved AROC and accuracy scores ranging from 0.79 to 0.93 and 71% to 86% respectively when tested on independent test sets. We predicted calpain cleavage sites on proteins from the receptor tyrosine kinase family and discovered potential sites of cleavage at critical regulatory domains. The results suggest a novel role of calpains as a direct regulator of receptor tyrosine kinase activity in cell survival and cell death pathways.
Keywords :
Bayes methods; bioinformatics; biological techniques; enzymes; feature extraction; molecular biophysics; molecular configurations; support vector machines; Bayes feature extraction; SVM based calpain degradome prediction; calcium dependent cysteine proteases; calpain cleavage sites; cancer; cell death; cell survival; computational models; machine learning algorithms; neurodegeneration; receptor tyrosine kinase activity; regulatory domains; residue propensities; sequence analyses; support vector machines; Accuracy; Amino acids; Encoding; Proteins; Substrates; Support vector machines; Training; Algorithms; Artificial Intelligence; Bayes Theorem; Calpain; Proteolysis; Support Vector Machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
Conference_Location :
San Diego, CA
ISSN :
1557-170X
Print_ISBN :
978-1-4244-4119-8
Electronic_ISBN :
1557-170X
Type :
conf
DOI :
10.1109/EMBC.2012.6347248
Filename :
6347248
Link To Document :
بازگشت