DocumentCode :
2516529
Title :
Filter-wrapper approach to feature selection using RST-DPSO for mining protein function
Author :
Rahman, Shah Atiqur ; Bakar, Afarulrazi Abu ; Hussein, Zeti Azura Mohamed
Author_Institution :
Fac. of Inf. Sci. & Technol., Univ. Kebangsaan Malaysia, Bangi, Malaysia
fYear :
2009
fDate :
27-28 Oct. 2009
Firstpage :
71
Lastpage :
78
Abstract :
This paper proposed a feature selection strategy based on rough set theory (RST) and discrete particle swarm optimization (DPSO) methods prior to classify protein function. RST is adopted in the first phase with the aim to eliminate the insignificant features and prepared the reduce features to the next phase. In the second phase, the reduced features are optimized using the new evolutionary computation method, DPSO. The optimum features from this two methods were mined using support vector machine classifier with the optimum RBF´s kernel parameters. These methods have greatly reduced the features and achieved higher classification accuracy across the selected datasets compared to full features and RST alone. The results also demonstrate that the integration of RST and DPSO is capable of searching the optimal features for protein classification and applicable to different classification problem.
Keywords :
bioinformatics; data mining; evolutionary computation; feature extraction; learning (artificial intelligence); particle swarm optimisation; pattern classification; proteins; radial basis function networks; rough set theory; sequences; support vector machines; RST-DPSO; data mining protein function classification; discrete particle swarm optimization; evolutionary computation method; feature selection strategy; filter-wrapper approach; machine learning; optimum RBF kernel parameter; protein sequence; radial basis function network; rough set theory; support vector machine classifier; Amino acids; Bioinformatics; Data mining; Feature extraction; Frequency; Optimization methods; Particle swarm optimization; Protein sequence; Support vector machine classification; Support vector machines; Data Mining; Feature Selection; Machine Learning; Optimization; Particle Swarm Optimization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining and Optimization, 2009. DMO '09. 2nd Conference on
Conference_Location :
Kajand
Print_ISBN :
978-1-4244-4944-6
Type :
conf
DOI :
10.1109/DMO.2009.5341906
Filename :
5341906
Link To Document :
بازگشت