Title of article
Prediction of protein submitochondria locations based on data fusion of various features of sequences
Author/Authors
Zakeri، نويسنده , , Pooya and Moshiri، نويسنده , , Behzad and Sadeghi، نويسنده , , Mehdi، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2011
Pages
9
From page
208
To page
216
Abstract
In this study, the predictors are developed for protein submitochondria locations based on various features of sequences. Information about the submitochondria location for a mitochondria protein can provide much better understanding about its function. We use ten representative models of protein samples such as pseudo amino acid composition, dipeptide composition, functional domain composition, the combining discrete model based on prediction of solvent accessibility and secondary structure elements, the discrete model of pairwise sequence similarity, etc. We construct a predictor based on support vector machines (SVMs) for each representative model. The overall prediction accuracy by the leave-one-out cross validation test obtained by the predictor which is based on the discrete model of pairwise sequence similarity is 1% better than the best computational system that exists for this problem. Moreover, we develop a method based on ordered weighted averaging (OWA) which is one of the fusion data operators. Therefore, OWA is applied on the 11 best SVM-based classifiers that are constructed based on various features of sequence. This method is called Mito-Loc. The overall leave-one-out cross validation accuracy obtained by Mito-Loc is about 95%. This indicates that our proposed approach (Mito-Loc) is superior to the result of the best existing approach which has already been reported.
Keywords
Ordered Weighted Averaging , Subcellular Localization , Support Vector Machines , Pairwise sequence similarity
Journal title
Journal of Theoretical Biology
Serial Year
2011
Journal title
Journal of Theoretical Biology
Record number
1540455
Link To Document