DocumentCode :
2983211
Title :
Feature Weighting and Selection Using Hypothesis Margin of Boosting
Author :
Alshawabkeh, M. ; Aslam, Javed A. ; Dy, Jennifer G. ; Kaeli, David
Author_Institution :
Dept. of Electr. & Comput. Eng., Northeastern Univ., Boston, MA, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
41
Lastpage :
50
Abstract :
Utilizing the concept of hypothesis margins to measure the quality of a set of features has been a growing line of research in the last decade. However, most previous algorithms have been developed under the large hypothesis margin principles of the 1-NN algorithm, such as Simba. Little attention has been paid so far to exploiting the hypothesis margins of boosting to evaluate features. Boosting is well known to maximize the training examples´ hypothesis margins, in particular, the average margins which are known to be the first statistics that considers the whole margin distribution. In this paper, we describe how to utilize the training examples´ mean margins of boosting to select features. A weight criterion, termed Margin Fraction (MF), is assigned to each feature that contributes to the average margin distribution combined in the final output produced by boosting. Applying the idea of MF to a sequential backward selection method, a new embedded selection algorithm is proposed, called SBS-MF. Experimentation is carried out using different data sets, which compares the proposed SBS-MF with two boosting based feature selection approaches, as well as to Simba. The results show that SBS-MF is effective in most of the cases.
Keywords :
learning (artificial intelligence); statistics; 1-NN algorithm; MF; SBS-MF; Simba; average margin distribution; boosting based feature selection approach; boosting hypothesis margin; embedded selection algorithm; feature weighting; margin distribution; margin fraction; sequential backward selection method; statistics; training example mean margins; weight criterion; Additives; Boosting; Computers; Educational institutions; Predictive models; Training; Weight measurement; Feature selection; average margin; boosting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
ISSN :
1550-4786
Print_ISBN :
978-1-4673-4649-8
Type :
conf
DOI :
10.1109/ICDM.2012.143
Filename :
6413786
Link To Document :
بازگشت