DocumentCode
1452161
Title
Two-Step Cross-Entropy Feature Selection for Microarrays—Power Through Complementarity
Author
Peters, Tim ; Bulger, David W. ; Loi, To-Ha ; Yang, Jean Yee Hwa ; Ma, David
Author_Institution
Dept. of Stat., Macquarie Univ., Sydney, NSW, Australia
Volume
8
Issue
4
fYear
2011
Firstpage
1148
Lastpage
1151
Abstract
Current feature selection methods for supervised classification of tissue samples from microarray data generally fail to exploit complementary discriminatory power that can be found in sets of features. Using a feature selection method with the computational architecture of the cross-entropy method, including an additional preliminary step ensuring a lower bound on the number of times any feature is considered, we show when testing on a human lymph node data set that there are a significant number of genes that perform well when their complementary power is assessed, but "pass under the radar” of popular feature selection methods that only assess genes individually on a given classification tool. We also show that this phenomenon becomes more apparent as diagnostic specificity of the tissue samples analysed increases.
Keywords
bioinformatics; biological tissues; data mining; feature extraction; genetics; complementary discriminatory power; diagnostic specificity; genes; human lymph node data set; microarray data; supervised classification; tissue samples; two-step cross-entropy feature selection; Australia; Bioinformatics; Biology; Cancer; Robustness; Smoothing methods; Training; Feature selection; data mining; genetic interdependence; lymphoma.; microarray; Algorithms; Artificial Intelligence; Computational Biology; Data Mining; Databases, Genetic; Humans; Lymph Nodes; Lymphoma, B-Cell; Oligonucleotide Array Sequence Analysis;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2011.30
Filename
5714685
Link To Document