• DocumentCode
    1452161
  • Title

    Two-Step Cross-Entropy Feature Selection for Microarrays—Power Through Complementarity

  • Author

    Peters, Tim ; Bulger, David W. ; Loi, To-Ha ; Yang, Jean Yee Hwa ; Ma, David

  • Author_Institution
    Dept. of Stat., Macquarie Univ., Sydney, NSW, Australia
  • Volume
    8
  • Issue
    4
  • fYear
    2011
  • Firstpage
    1148
  • Lastpage
    1151
  • Abstract
    Current feature selection methods for supervised classification of tissue samples from microarray data generally fail to exploit complementary discriminatory power that can be found in sets of features. Using a feature selection method with the computational architecture of the cross-entropy method, including an additional preliminary step ensuring a lower bound on the number of times any feature is considered, we show when testing on a human lymph node data set that there are a significant number of genes that perform well when their complementary power is assessed, but "pass under the radar” of popular feature selection methods that only assess genes individually on a given classification tool. We also show that this phenomenon becomes more apparent as diagnostic specificity of the tissue samples analysed increases.
  • Keywords
    bioinformatics; biological tissues; data mining; feature extraction; genetics; complementary discriminatory power; diagnostic specificity; genes; human lymph node data set; microarray data; supervised classification; tissue samples; two-step cross-entropy feature selection; Australia; Bioinformatics; Biology; Cancer; Robustness; Smoothing methods; Training; Feature selection; data mining; genetic interdependence; lymphoma.; microarray; Algorithms; Artificial Intelligence; Computational Biology; Data Mining; Databases, Genetic; Humans; Lymph Nodes; Lymphoma, B-Cell; Oligonucleotide Array Sequence Analysis;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2011.30
  • Filename
    5714685