Title :
Prediction of Protein Sub-Cellular Localization through Weighted Combination of Classifiers
Author :
Fayyaz, M. ; Mujahid, A. ; Khan, A. ; Bangash, A.
Author_Institution :
Ghulam Ishaq Khan (GIK) Inst. of Eng. Sci. & Technol., Swabi
Abstract :
Prediction of subcellular localization of proteins is an important step in genome annotation and in search for achieving novel drug targets. Conducting experiments for extracting information about protein sub cellular localization is both time consuming and costly effort. Machine learning approaches, especially, ensemble of classifiers, providing efficient and reliable mechanism of computational prediction are thus highly desired In this context, we propose a modification to the approach proposed in [K. C. Chou, J. Cell. Biol. 99(2006)517]. We have used a weighted polling method to fuse the output of individual covariant discriminant classifiers. The individual classifiers are trained on features based on pseudo-amino add composition of proteins. Three methods of verifications; re-substitution, Jackknife, and independent data set tests have been employed and give over all accuracies of 87.13%, 71.15% and 74.90% respectively. The predicted accuracies are higher than that of the existing schemes.
Keywords :
biochemistry; biology computing; cellular biophysics; molecular biophysics; pattern classification; proteins; Jackknife test; independent data set test; individual covariant discriminant classifiers; information extraction; protein sub-cellular localization prediction; pseudo-amino add composition; re-substitution test; verification test; weighted polling method; Amino acids; Cells (biology); Computer science; Data mining; Drugs; Mechatronics; Organisms; Protein engineering; Sequences; Testing;
Conference_Titel :
Electrical Engineering, 2007. ICEE '07. International Conference on
Conference_Location :
Lahore
Print_ISBN :
1-4244-0893-8
Electronic_ISBN :
1-4244-0893-8
DOI :
10.1109/ICEE.2007.4287289