Title :
The Problem of Fragile Feature Subset Preference in Feature Selection Methods and a Proposal of Algorithmic Workaround
Author :
Somol, P. ; Grim, J. ; Pudil, P.
Author_Institution :
Dept. of Pattern Recognition, Inst. of Inf. Theor. & Autom., Prague, Czech Republic
Abstract :
We point out a problem inherent in the optimization scheme of many popular feature selection methods. It follows from the implicit assumption that higher feature selection criterion value always indicates more preferable subset even if the value difference is marginal. This assumption ignores the reliability issues of particular feature preferences, over-fitting and feature acquisition cost. We propose an algorithmic extension applicable to many standard feature selection methods allowing better control over feature subset preference. We show experimentally that the proposed mechanism is capable of reducing the size of selected subsets as well as improving classifier generalization.
Keywords :
learning (artificial intelligence); optimisation; pattern classification; algorithmic workaround proposal; classifier generalization improvement; feature selection criterion value; feature selection methods; fragile feature subset preference; optimization scheme; Accuracy; Economics; Electronic mail; Information theory; Pattern recognition; Search problems; Support vector machines; classification; feature acquisition cost; feature selection; feature weights; machine learning; over-fitting; weighted features;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.1068