DocumentCode :
2963410
Title :
Predicting signal peptide and its cleavage site by using GA-optimized position weight matrices
Author :
Lao, Demelo M. ; Avila, J.M.C.
Author_Institution :
Dept. of Comput. Sci., Univ. of the Philippines Cebu Coll., Cebu, Philippines
fYear :
2012
fDate :
19-22 Nov. 2012
Firstpage :
1
Lastpage :
6
Abstract :
Signal peptide and cleavage site predictions are very important fields in bioinformatics because of its contributions in modern cell biological research, molecular mechanisms of diseases, and drug discoveries. In this paper, we present the results in signal peptide and cleavage site predictions using the weight matrix approach utilizing genetic algorithm (GA)-optimized position weight matrix (PWM) profiles each for eukaryotic, gram-negative and gram-positive prokaryotic organisms. The consistency tests yielded overall performance ratings of roughly 97% for signal peptide prediction while approximately 77% for cleavage site prediction at position 0. Cross-validation results showed that the overall performances of using the GA-optimized profile matrices in predicting the presence of signal peptides were as accurate as around 95%. However, for cleavage site prediction, the three optimized profile matrices produced overall accuracy of about 72%-74% in predicting the actual cleavage site location. For protein sequences belonging to the prokaryote organism that are not labeled as gram-negative or gram-positive, predicting for the correct cleavage site location by the GA-optimized PWM profile of the former consistently resulted to higher success ratings. A comparison between the latest existing profile matrices (used in signal peptide and cleavage site predictions) showed only a slight improvement in the overall performance. Although the improvement is minimal, it makes a lot of difference when analyzing large datasets or genomic protein sequences.
Keywords :
bioinformatics; genetic algorithms; genetics; genomics; microorganisms; molecular biophysics; molecular configurations; proteins; bioinformatics; cleavage site predictions; diseases; drug discovery; eukaryotic organisms; genetic algorithm-optimized position weight matrices; genomic protein sequences; gram-negative organisms; gram-positive prokaryotic organisms; modern cell biological research; molecular mechanisms; signal peptide; Amino acids; Organisms; Peptides; Protein sequence; Pulse width modulation; Sociology; GA optimization; PWM profile; cleavage site prediction; signal peptide; weight matrix method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON 2012 - 2012 IEEE Region 10 Conference
Conference_Location :
Cebu
ISSN :
2159-3442
Print_ISBN :
978-1-4673-4823-2
Electronic_ISBN :
2159-3442
Type :
conf
DOI :
10.1109/TENCON.2012.6412173
Filename :
6412173
Link To Document :
بازگشت