• DocumentCode
    3046347
  • Title

    Feature Selection for Cancer Classification Based on Support Vector Machine

  • Author

    Luo, Wei ; Wang, Lipo ; Sun, Jingjing

  • Author_Institution
    Coll. of Inf. Eng., Xiangtan Univ., Xiangtan, China
  • Volume
    4
  • fYear
    2009
  • fDate
    19-21 May 2009
  • Firstpage
    422
  • Lastpage
    426
  • Abstract
    Feature selection plays an important role in cancer classification, for gene expression data usually have a large number of dimensions and relatively a small number of samples. In this paper, we use the support vector machine (SVM) for cancer classification. We propose a mixed two-step feature selection method. The first step uses a modified t-test method to select discriminatory features. The second step extracts principal components from the top-ranked genes based on the modified t-test method. We tested our two-step method in three data sets, i.e., the lymphoma data set, the SRBCT data set, and the ovarian cancer data set. The results in all the three data sets show our two-step methods is able to achieve 100% accuracy with much fewer genes than other published results.
  • Keywords
    cancer; feature extraction; genetics; medical computing; pattern classification; principal component analysis; support vector machines; tumours; SVM; cancer classification; feature selection; gene expression data; principal component extraction; support vector machine; t-test method; top-ranked gene; Cancer; Classification algorithms; Data engineering; Educational institutions; Gene expression; Intelligent systems; Machine intelligence; Sun; Support vector machine classification; Support vector machines; SVM; cancer classification; gene expression data; principal component analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems, 2009. GCIS '09. WRI Global Congress on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-0-7695-3571-5
  • Type

    conf

  • DOI
    10.1109/GCIS.2009.45
  • Filename
    5209263