DocumentCode
3219916
Title
Prediction of Protein Sub-Cellular Localization through Weighted Combination of Classifiers
Author
Fayyaz, M. ; Mujahid, A. ; Khan, A. ; Bangash, A.
Author_Institution
Ghulam Ishaq Khan (GIK) Inst. of Eng. Sci. & Technol., Swabi
fYear
2007
fDate
11-12 April 2007
Firstpage
1
Lastpage
6
Abstract
Prediction of subcellular localization of proteins is an important step in genome annotation and in search for achieving novel drug targets. Conducting experiments for extracting information about protein sub cellular localization is both time consuming and costly effort. Machine learning approaches, especially, ensemble of classifiers, providing efficient and reliable mechanism of computational prediction are thus highly desired In this context, we propose a modification to the approach proposed in [K. C. Chou, J. Cell. Biol. 99(2006)517]. We have used a weighted polling method to fuse the output of individual covariant discriminant classifiers. The individual classifiers are trained on features based on pseudo-amino add composition of proteins. Three methods of verifications; re-substitution, Jackknife, and independent data set tests have been employed and give over all accuracies of 87.13%, 71.15% and 74.90% respectively. The predicted accuracies are higher than that of the existing schemes.
Keywords
biochemistry; biology computing; cellular biophysics; molecular biophysics; pattern classification; proteins; Jackknife test; independent data set test; individual covariant discriminant classifiers; information extraction; protein sub-cellular localization prediction; pseudo-amino add composition; re-substitution test; verification test; weighted polling method; Amino acids; Cells (biology); Computer science; Data mining; Drugs; Mechatronics; Organisms; Protein engineering; Sequences; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical Engineering, 2007. ICEE '07. International Conference on
Conference_Location
Lahore
Print_ISBN
1-4244-0893-8
Electronic_ISBN
1-4244-0893-8
Type
conf
DOI
10.1109/ICEE.2007.4287289
Filename
4287289
Link To Document