DocumentCode :
1863254
Title :
A hybrid approach for selecting gene subsets using gene expression data
Author :
Yang, Chang-san ; Chuang, Li-Yeh ; Ke, Chao-hsuan ; Yang, Cheng-Hong
Author_Institution :
Inst. of Biomed. Eng., Tainan
fYear :
2008
fDate :
25-27 June 2008
Firstpage :
159
Lastpage :
164
Abstract :
Many previous research papers have demonstrated that microarray gene expression data are useful for disease classification and medical diagnosis. Cancer microarray data normally have a particular characteristic where features (genes) greatly exceed the instance (tissue sample) numbers. Selecting appropriate numbers and relevant features to differentiate different types of cancer remains a challenge in bioinformatics. In order to select useful gene sets from microarray data to promote classification performance effectively, feature selection approaches were included in many previous literature reports. In this paper, a hybrid approach which combines correlation-based feature selection and binary particle swarm optimization was used to select few subsets, combined with the K-nearest neighbor method as a classifier to evaluate the classification performance. The proposed approach is applied on six microarray gene expression data sets that relate to human cancer. The experimental results show that the proposed approach selects a smaller number of feature subsets and obtains better classification accuracy.
Keywords :
bioinformatics; cancer; feature extraction; genetics; medical diagnostic computing; particle swarm optimisation; pattern classification; tumours; K-nearest neighbor method; binary particle swarm optimization; bioinformatics; cancer microarray data; correlation-based feature selection; disease classification; gene subset selection; medical diagnosis; microarray gene expression data; Computer applications; Computer industry; Gene expression; Feature selection; Microarray gene expression data; binary particle swarm optimization; correlation-based feature selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Soft Computing in Industrial Applications, 2008. SMCia '08. IEEE Conference on
Conference_Location :
Muroran
Print_ISBN :
978-1-4244-3782-5
Electronic_ISBN :
978-4-9904-2590-6
Type :
conf
DOI :
10.1109/SMCIA.2008.5045953
Filename :
5045953
Link To Document :
بازگشت