DocumentCode :
2731926
Title :
Balanced accuracy for feature subset selection with genetic algorithms
Author :
Peterson, Michael R. ; Raymer, Michael L. ; Lamont, Gary B.
Author_Institution :
Dept. of CS & Eng., Wright State Univ., Dayton, OH, USA
Volume :
3
fYear :
2005
fDate :
2-5 Sept. 2005
Firstpage :
2506
Abstract :
The relevance of a set of measured features describing labeled patterns within a problem domain affects classifier performance. Feature subset selection algorithms employing a wrapper approach typically assess the fitness of a feature subset simply as the accuracy of a given classifier over a set of available patterns using the candidate feature set. For datasets with many patterns for some classes and few for others, relatively high accuracy may be achieved simply by labeling unknown patterns according to the largest class. Feature selection wrappers that only emphasize high accuracy typically follow this bias. Class bias may be mitigated by emphasizing well-balanced accuracy during the optimization algorithm. This paper proposes adding selective pressure for balanced accuracy to mitigate class bias during feature set evolution. Experiments compare the selection performance of genetic algorithms using various fitness functions varying in terms of accuracy, balance, and feature parsimony. Several feature selection algorithms including greedy, genetic, filter, and hybrid filter/GA approaches are then compared using the best fitness function. The experiments employ a naive Bayes classifier and public domain datasets. The results suggest that improvements to class balance and feature subset size can be made without compromising overall accuracy or run-time efficiency.
Keywords :
Bayes methods; feature extraction; genetic algorithms; learning (artificial intelligence); pattern clustering; balanced accuracy; feature subset selection; genetic algorithms; labeled patterns; optimization; Filters; Force measurement; Genetic algorithms; Labeling; Machine learning; Military computing; Neural networks; Pattern recognition; Runtime; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation, 2005. The 2005 IEEE Congress on
Print_ISBN :
0-7803-9363-5
Type :
conf
DOI :
10.1109/CEC.2005.1555008
Filename :
1555008
Link To Document :
بازگشت