DocumentCode :
3154138
Title :
A noise-based stability evaluation of threshold-based feature selection techniques
Author :
Altidor, Wilker ; Khoshgoftaar, Taghi M. ; Napolitano, Amri
Author_Institution :
Florida Atlantic Univ., Boca Raton, FL, USA
fYear :
2011
fDate :
3-5 Aug. 2011
Firstpage :
240
Lastpage :
245
Abstract :
This paper presents a noise-based stability performance evaluation approach for feature selection techniques. For the stability assessment, a similarity-based measure is used to quantify the degree of agreement between a filter´s output on a clean dataset and its outputs on the same dataset corrupted with different combinations of noise level and noise distribution. Experiments are conducted with 11 threshold-based feature selection techniques on six different real-world datasets. The experimental results show that some filters perform much better than others in terms of their insensitivity to noise. The results also show an interesting relationship between the size of the training data and the stability performance of a filter; the stability performance of a filter tends to improve when learning from large size datasets.
Keywords :
feature extraction; learning (artificial intelligence); pattern classification; statistical analysis; very large databases; clean dataset; large size datasets; learning; noise-based stability evaluation; stability assessment; threshold-based feature selection techniques; Cancer; Indexes; Lungs; Noise; Noise measurement; Stability criteria; class noise; kuncheva index; large size datasets; stability; threshold-based feature selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Reuse and Integration (IRI), 2011 IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4577-0964-7
Electronic_ISBN :
978-1-4577-0965-4
Type :
conf
DOI :
10.1109/IRI.2011.6009553
Filename :
6009553
Link To Document :
بازگشت