DocumentCode :
990016
Title :
Fast branch & bound algorithms for optimal feature selection
Author :
Somol, Petr ; Pudil, Pavel ; Kittler, Josef
Volume :
26
Issue :
7
fYear :
2004
fDate :
7/1/2004 12:00:00 AM
Firstpage :
900
Lastpage :
912
Abstract :
A novel search principle for optimal feature subset selection using the branch & bound method is introduced. Thanks to a simple mechanism for predicting criterion values, a considerable amount of time can be saved by avoiding many slow criterion evaluations. We propose two implementations of the proposed prediction mechanism that are suitable for use with nonrecursive and recursive criterion forms, respectively. Both algorithms find the optimum usually several times faster than any other known branch & bound algorithm. As the algorithm computational efficiency is crucial, due to the exponential nature of the search problem, we also investigate other factors that affect the search performance of all branch & bound algorithms. Using a set of synthetic criteria, we show that the speed of the branch & bound algorithms strongly depends on the diversity among features, feature stability with respect to different subsets, and criterion function dependence on feature set size. We identify the scenarios where the search is accelerated the most dramatically (finish in linear time), as well as the worst conditions. We verify our conclusions experimentally on three real data sets using traditional probabilistic distance criteria.
Keywords :
computational complexity; feature extraction; pattern recognition; prediction theory; probabilistic logic; recursive functions; set theory; tree searching; trees (mathematics); fast branch and bound algorithm; feature set; feature stability; nonrecursive criterion; optimal feature subset selection; prediction mechanism; probabilistic distance criteria; probabilistic logic; recursive criterion; search problem; tree searching; Acceleration; Artificial intelligence; Computational complexity; Computational efficiency; Helium; Performance gain; Search methods; Search problems; Size measurement; Stability criteria; Subset search; artificial intelligence.; dimensionality reduction; feature selection; optimum search; search tree; subset selection; Algorithms; Artificial Intelligence; Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2004.28
Filename :
1300560
Link To Document :
بازگشت