Title :
Mitochondrial protein cleavage site predictor
Author :
Ukasawa, Yoshinori F. ; Wan, Raymond ; Fu, Szu-Chin ; Tsuji, Junko ; Sakiyama, Noriyuki ; Imai, Kenichiro ; Horton, Paul
Author_Institution :
Grad. Sch. of Frontier Sci., Univ. of Tokyo, Tokyo, Japan
Abstract :
Summary form only given. We have constructed a new tool to predict the cleavage sites of mitochondrial proteins. Trained on newer data, in preliminary results our tool compares favorably with the industry standard tool TargetP. In most eukaryotic cells, over 1000 nuclear-encoded proteins are translocated into the mitochondria to perform their function. Roughly half of these have an N-terminal sorting signal known as the MTS (Matrix Targeting Signal) and most of those are cleaved after mitochondrial import by MPP (Mitochondrial Protein Peptidase) and other peptidases. The position of these cleavage sites is necessary to know the amino acid sequence of the mature protein. TargetP is a classic tool for predicting MPP cleavage sites. However, recently large-scale experiments in yeast gave us an opportunity to create a new classifier based on much more data. Ultimately we hope to predict all of the cleavage events experienced by proteins as they mature in the mitochondria, but in our poster we will describe the framework shown in figure 2, encompassing the peptidases MPP, Octl, and Icp55. MPP typically cleaves one residue after an arginine. Subsequently, some proteins are further cleaved by Octl (cleaving eight residues) or by Icp55 (cleaving one residue). Figure 1 shows the sequence logos for the inferred cleavage sites of each peptidase and an extra group of currently unexplained data. As in Vogtle et al, we have inferred some intermediate sites and the identity of the peptidases based on the position of arginines. Currently we have had the most success with predicting MPP sites. Our first attempt uses a profile HMM (Hidden Markov Model) as implemented by the HMMER2 package (hmmer.janelia.org/). Preliminary cross-validation results suggest that our new predictor outperforms TargetP for this task. We are also exploring other techniques including SVM classifiers and Dynamic Bayesian networks.
Keywords :
biology computing; cellular biophysics; hidden Markov models; proteins; HMMER2 package; Icp55; MPP; MTS; N-terminal sorting signal; Octl; SVM classifiers; TargetP; amino acid sequence; arginine; dynamic Bayesian networks; eukaryotic cells; hidden Markov model; industry standard tool; matrix targeting signal; mitochondrial protein cleavage site predictor; mitochondrial protein peptidase; nuclear-encoded proteins; profile HMM; yeast;
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2010 IEEE International Conference on
Conference_Location :
Hong, Kong
Print_ISBN :
978-1-4244-8303-7
Electronic_ISBN :
978-1-4244-8304-4
DOI :
10.1109/BIBMW.2010.5703951