DocumentCode
1930491
Title
Predicting Protein Subcellular Localizations for Gram-Negative Bacteria Using DP-PSSM and Support Vector Machines
Author
Juan, Eric Y T ; Li, W.J. ; Jhang, J.H. ; Chiu, C.H.
Author_Institution
Dept. of Comput. Sci. & Eng., Nat. Taiwan Ocean Univ., Keelung
fYear
2009
fDate
16-19 March 2009
Firstpage
836
Lastpage
841
Abstract
Invisible bacteria are found almost everywhere, and having a great impact on our everyday life. Particularly, many species of gram-negative bacteria are pathogenic and cause a wide variety of diseases in humans and animals. It is crucial in drug design to cure diseases brought by gram-negative bacteria. Unfortunately, a new drug discovery can be expensive and time-consuming even with the advance of biotechnology. Designing a highly effective and efficient computational system, especially for identifying protein subcellular localization for gram-negative bacteria, is an important research field.In this paper, we propose a new computational system which combines a well-known classifier, support vector machines (SVMs), a protein descriptor, DP-PSSM (Directional Property-PSSM), and an optimal tool for system tuning. In addition, an evolutionary computation based feature selection technique is applied to further improve the performance of our computational system. Our computational system, EF-SVM-PSL, had been tested through 10 fold cross validation on predicting subcellular localizations of three gram-negative bacteria protein datasets, PS1444, NR828, and EV243. Our EF-SVM-PSL has a relative simple architecture and performs competitively with the best alternative systems.
Keywords
biotechnology; evolutionary computation; microorganisms; proteins; support vector machines; biotechnology; drug design; drug discovery; evolutionary computation; feature selection technique; gram-negative bacteria; invisible bacteria; protein datasets; protein descriptor; protein subcellular localizations; support vector machines; Animals; Biotechnology; Diseases; Drugs; Humans; Microorganisms; Pathogens; Proteins; Support vector machine classification; Support vector machines; (Protein Subcellular Localization); DP-PSSM(Directional Property-PSSM); EF-SVM-PSL (Evolutionary Feature selection-SVM-PSL ); SVMs (support vector machines); and feature selection; evolutionary computation;
fLanguage
English
Publisher
ieee
Conference_Titel
Complex, Intelligent and Software Intensive Systems, 2009. CISIS '09. International Conference on
Conference_Location
Fukuoka
Print_ISBN
978-1-4244-3569-2
Electronic_ISBN
978-0-7695-3575-3
Type
conf
DOI
10.1109/CISIS.2009.194
Filename
5066887
Link To Document