DocumentCode
3265384
Title
Predicting Peroxisomal Proteins
Author
Hawkins, John ; Bodén, Mikael
Author_Institution
School of Information Technology and Electrical Engineering The University of Queensland QLD 4072, Australia, Email: jhawkins@itee.uq.edu.au
fYear
2005
fDate
14-15 Nov. 2005
Firstpage
1
Lastpage
6
Abstract
PTS1 proteins are peroxisomal matrix proteins that have a well conserved targeting motif at the C-terminal end. However, this motif is present in many non peroxisomal proteins as well, thus predicting peroxisomal proteins involves differentiating fake PTS1 signals from actual ones. In this paper we report on the development of an SVM classifier with a separately trained logistic output function. The model uses an input window containing 12 consecutive residues at the C-terminus and the amino acid composition of the full sequence. The final model gives a Matthews Correlation Coefficient of 0.77, representing an increase of 54% compared with the well-known PeroxiP predictor. We test the model by applying it to several proteomes of eukaryotes for which there is no evidence of a peroxisome, producing a false positive rate of 0.088%.
Keywords
Accuracy; Amino acids; Biomembranes; Logistics; Machine learning; Predictive models; Protein engineering; Support vector machine classification; Support vector machines; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence in Bioinformatics and Computational Biology, 2005. CIBCB '05. Proceedings of the 2005 IEEE Symposium on
Print_ISBN
0-7803-9387-2
Type
conf
DOI
10.1109/CIBCB.2005.1594956
Filename
1594956
Link To Document