• DocumentCode
    1219989
  • Title

    A Multiple-Filter-Multiple-Wrapper Approach to Gene Selection and Microarray Data Classification

  • Author

    Leung, Yukyee ; Hung, Yeungsam

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Univ. of Hong Kong, Hong Kong, China
  • Volume
    7
  • Issue
    1
  • fYear
    2010
  • Firstpage
    108
  • Lastpage
    117
  • Abstract
    Filters and wrappers are two prevailing approaches for gene selection in microarray data analysis. Filters make use of statistical properties of each gene to represent its discriminating power between different classes. The computation is fast but the predictions are inaccurate. Wrappers make use of a chosen classifier to select genes by maximizing classification accuracy, but the computation burden is formidable. Filters and wrappers have been combined in previous studies to maximize the classification accuracy for a chosen classifier with respect to a filtered set of genes. The drawback of this single-filter-single-wrapper (SFSW) approach is that the classification accuracy is dependent on the choice of specific filter and wrapper. In this paper, a multiple-filter-multiple-wrapper (MFMW) approach is proposed that makes use of multiple filters and multiple wrappers to improve the accuracy and robustness of the classification, and to identify potential biomarker genes. Experiments based on six benchmark data sets show that the MFMW approach outperforms SFSW models (generated by all combinations of filters and wrappers used in the corresponding MFMW model) in all cases and for all six data sets. Some of MFMW-selected genes have been confirmed to be biomarkers or contribute to the development of particular cancers by other studies.
  • Keywords
    cancer; classification; genetics; medical diagnostic computing; biomarker genes; cancers; gene selection; microarray data classification; multiple filter multiple wrapper approach; Classifier design and evaluation; Feature evaluation and selection; Filters; gene selection; hybrid classification models; microarray data classification; wrappers.; Algorithms; Gene Expression Profiling; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Signal Processing, Computer-Assisted;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2008.46
  • Filename
    4522538