Title :
Predicting Signal Peptides and Their Cleavage Sites Using Support Vector Machines and Improved Position Weight Matrixes
Author :
Sun, Jingjing ; Wang, Lipo
Author_Institution :
Coll. of Inf. Eng., Xiangtan Univ., Xiangtan
Abstract :
In this paper, we develop a method for predicting signal peptides and their cleavage sites. Unlike other published work, we divide proteins into two segments and calculate the amino acid compositions on both segments. After that, we hybridize the pseudo amino acid compositions (PseAAs) to the feature vectors. Using support vector machines (SVMs) to train the datasets, we get better results than those with the optimized evidence-theoretic K nearest neighbor (OET-KNN) classifier. The overall rate of correct prediction for signal peptides is over 97%. For identifying cleavage sites, we use the scaled window proposed by Chou to extract cleavable secretory segments and non-cleavable secretory segments and improve the position weight matrix (PWM) method proposed by Hiller et al.. By hybridizing the scaled window and PWM methods, the correct prediction for signal peptides cleavage sites is also better or comparable to other methods.
Keywords :
biology computing; matrix algebra; molecular biophysics; proteins; support vector machines; SVM; cleavable secretory segment; cleavage site; position weight matrix method; proteins; pseudo amino acid composition; signal peptide; support vector machine; Amino acids; Educational institutions; Peptides; Protein engineering; Pulse width modulation; Sequences; Signal processing; Sun; Support vector machine classification; Support vector machines; Signal peptide; position weight matrix; pseudo amino acid composition; support vector machine;
Conference_Titel :
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Conference_Location :
Jinan
Print_ISBN :
978-0-7695-3304-9
DOI :
10.1109/ICNC.2008.406