DocumentCode
3046347
Title
Feature Selection for Cancer Classification Based on Support Vector Machine
Author
Luo, Wei ; Wang, Lipo ; Sun, Jingjing
Author_Institution
Coll. of Inf. Eng., Xiangtan Univ., Xiangtan, China
Volume
4
fYear
2009
fDate
19-21 May 2009
Firstpage
422
Lastpage
426
Abstract
Feature selection plays an important role in cancer classification, for gene expression data usually have a large number of dimensions and relatively a small number of samples. In this paper, we use the support vector machine (SVM) for cancer classification. We propose a mixed two-step feature selection method. The first step uses a modified t-test method to select discriminatory features. The second step extracts principal components from the top-ranked genes based on the modified t-test method. We tested our two-step method in three data sets, i.e., the lymphoma data set, the SRBCT data set, and the ovarian cancer data set. The results in all the three data sets show our two-step methods is able to achieve 100% accuracy with much fewer genes than other published results.
Keywords
cancer; feature extraction; genetics; medical computing; pattern classification; principal component analysis; support vector machines; tumours; SVM; cancer classification; feature selection; gene expression data; principal component extraction; support vector machine; t-test method; top-ranked gene; Cancer; Classification algorithms; Data engineering; Educational institutions; Gene expression; Intelligent systems; Machine intelligence; Sun; Support vector machine classification; Support vector machines; SVM; cancer classification; gene expression data; principal component analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems, 2009. GCIS '09. WRI Global Congress on
Conference_Location
Xiamen
Print_ISBN
978-0-7695-3571-5
Type
conf
DOI
10.1109/GCIS.2009.45
Filename
5209263
Link To Document