DocumentCode
2311850
Title
Comparison of feature ranking methods based on information entropy
Author
Duch, Wlodzislaw ; Wieczorek, Tadeusz ; Biesiada, Jacek ; Blachnik, Marcin
Author_Institution
Dept. of Inf., Nicholas Copernicus Univ., Torun, Poland
Volume
2
fYear
2004
fDate
25-29 July 2004
Firstpage
1415
Abstract
A comparison between five feature ranking methods based on entropy is presented on artificial and real datasets. Feature ranking method using χ2 statistics gives results that are very similar to the entropy-based methods. The quality of feature rankings obtained by these methods is evaluated using the decision tree and the nearest neighbor classifier with growing number of most important features. Significant differences are found in some cases, but there is no single best index that works best for all data and all classifiers. Therefore to be sure that a subset of features giving highest accuracy has been selected requires the use of many different indices.
Keywords
decision trees; entropy; feature extraction; pattern classification; set theory; statistics; χ2 statistics; artificial datasets; decision trees; feature ranking methods; information entropy based methods; nearest neighbor classifier; real datasets; subsets; Bioinformatics; Classification tree analysis; Decision trees; Feature extraction; Filters; Informatics; Information entropy; Nearest neighbor searches; Statistics; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on
ISSN
1098-7576
Print_ISBN
0-7803-8359-1
Type
conf
DOI
10.1109/IJCNN.2004.1380157
Filename
1380157
Link To Document