Title :
A Multiple-Filter-Multiple-Wrapper Approach to Gene Selection and Microarray Data Classification
Author :
Leung, Yukyee ; Hung, Yeungsam
Author_Institution :
Dept. of Electr. & Electron. Eng., Univ. of Hong Kong, Hong Kong, China
Abstract :
Filters and wrappers are two prevailing approaches for gene selection in microarray data analysis. Filters make use of statistical properties of each gene to represent its discriminating power between different classes. The computation is fast but the predictions are inaccurate. Wrappers make use of a chosen classifier to select genes by maximizing classification accuracy, but the computation burden is formidable. Filters and wrappers have been combined in previous studies to maximize the classification accuracy for a chosen classifier with respect to a filtered set of genes. The drawback of this single-filter-single-wrapper (SFSW) approach is that the classification accuracy is dependent on the choice of specific filter and wrapper. In this paper, a multiple-filter-multiple-wrapper (MFMW) approach is proposed that makes use of multiple filters and multiple wrappers to improve the accuracy and robustness of the classification, and to identify potential biomarker genes. Experiments based on six benchmark data sets show that the MFMW approach outperforms SFSW models (generated by all combinations of filters and wrappers used in the corresponding MFMW model) in all cases and for all six data sets. Some of MFMW-selected genes have been confirmed to be biomarkers or contribute to the development of particular cancers by other studies.
Keywords :
cancer; classification; genetics; medical diagnostic computing; biomarker genes; cancers; gene selection; microarray data classification; multiple filter multiple wrapper approach; Classifier design and evaluation; Feature evaluation and selection; Filters; gene selection; hybrid classification models; microarray data classification; wrappers.; Algorithms; Gene Expression Profiling; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Signal Processing, Computer-Assisted;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2008.46