DocumentCode :
1219989
Title :
A Multiple-Filter-Multiple-Wrapper Approach to Gene Selection and Microarray Data Classification
Author :
Leung, Yukyee ; Hung, Yeungsam
Author_Institution :
Dept. of Electr. & Electron. Eng., Univ. of Hong Kong, Hong Kong, China
Volume :
7
Issue :
1
fYear :
2010
Firstpage :
108
Lastpage :
117
Abstract :
Filters and wrappers are two prevailing approaches for gene selection in microarray data analysis. Filters make use of statistical properties of each gene to represent its discriminating power between different classes. The computation is fast but the predictions are inaccurate. Wrappers make use of a chosen classifier to select genes by maximizing classification accuracy, but the computation burden is formidable. Filters and wrappers have been combined in previous studies to maximize the classification accuracy for a chosen classifier with respect to a filtered set of genes. The drawback of this single-filter-single-wrapper (SFSW) approach is that the classification accuracy is dependent on the choice of specific filter and wrapper. In this paper, a multiple-filter-multiple-wrapper (MFMW) approach is proposed that makes use of multiple filters and multiple wrappers to improve the accuracy and robustness of the classification, and to identify potential biomarker genes. Experiments based on six benchmark data sets show that the MFMW approach outperforms SFSW models (generated by all combinations of filters and wrappers used in the corresponding MFMW model) in all cases and for all six data sets. Some of MFMW-selected genes have been confirmed to be biomarkers or contribute to the development of particular cancers by other studies.
Keywords :
cancer; classification; genetics; medical diagnostic computing; biomarker genes; cancers; gene selection; microarray data classification; multiple filter multiple wrapper approach; Classifier design and evaluation; Feature evaluation and selection; Filters; gene selection; hybrid classification models; microarray data classification; wrappers.; Algorithms; Gene Expression Profiling; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Signal Processing, Computer-Assisted;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2008.46
Filename :
4522538
Link To Document :
بازگشت