• DocumentCode
    2311850
  • Title

    Comparison of feature ranking methods based on information entropy

  • Author

    Duch, Wlodzislaw ; Wieczorek, Tadeusz ; Biesiada, Jacek ; Blachnik, Marcin

  • Author_Institution
    Dept. of Inf., Nicholas Copernicus Univ., Torun, Poland
  • Volume
    2
  • fYear
    2004
  • fDate
    25-29 July 2004
  • Firstpage
    1415
  • Abstract
    A comparison between five feature ranking methods based on entropy is presented on artificial and real datasets. Feature ranking method using χ2 statistics gives results that are very similar to the entropy-based methods. The quality of feature rankings obtained by these methods is evaluated using the decision tree and the nearest neighbor classifier with growing number of most important features. Significant differences are found in some cases, but there is no single best index that works best for all data and all classifiers. Therefore to be sure that a subset of features giving highest accuracy has been selected requires the use of many different indices.
  • Keywords
    decision trees; entropy; feature extraction; pattern classification; set theory; statistics; χ2 statistics; artificial datasets; decision trees; feature ranking methods; information entropy based methods; nearest neighbor classifier; real datasets; subsets; Bioinformatics; Classification tree analysis; Decision trees; Feature extraction; Filters; Informatics; Information entropy; Nearest neighbor searches; Statistics; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
  • ISSN
    1098-7576
  • Print_ISBN
    0-7803-8359-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2004.1380157
  • Filename
    1380157