Title :
Two-Step Cross-Entropy Feature Selection for Microarrays—Power Through Complementarity
Author :
Peters, Tim ; Bulger, David W. ; Loi, To-Ha ; Yang, Jean Yee Hwa ; Ma, David
Author_Institution :
Dept. of Stat., Macquarie Univ., Sydney, NSW, Australia
Abstract :
Current feature selection methods for supervised classification of tissue samples from microarray data generally fail to exploit complementary discriminatory power that can be found in sets of features. Using a feature selection method with the computational architecture of the cross-entropy method, including an additional preliminary step ensuring a lower bound on the number of times any feature is considered, we show when testing on a human lymph node data set that there are a significant number of genes that perform well when their complementary power is assessed, but "pass under the radar” of popular feature selection methods that only assess genes individually on a given classification tool. We also show that this phenomenon becomes more apparent as diagnostic specificity of the tissue samples analysed increases.
Keywords :
bioinformatics; biological tissues; data mining; feature extraction; genetics; complementary discriminatory power; diagnostic specificity; genes; human lymph node data set; microarray data; supervised classification; tissue samples; two-step cross-entropy feature selection; Australia; Bioinformatics; Biology; Cancer; Robustness; Smoothing methods; Training; Feature selection; data mining; genetic interdependence; lymphoma.; microarray; Algorithms; Artificial Intelligence; Computational Biology; Data Mining; Databases, Genetic; Humans; Lymph Nodes; Lymphoma, B-Cell; Oligonucleotide Array Sequence Analysis;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2011.30