DocumentCode :
3106209
Title :
Getting the Most Out of Ensemble Selection
Author :
Caruana, Rich ; Munson, Art ; Niculescu-Mizil, Alexandru
Author_Institution :
Dept. of Comput. Sci., Cornell Univ., Ithaca, NY
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
828
Lastpage :
833
Abstract :
We investigate four previously unexplored aspects of ensemble selection, a procedure for building ensembles of classifiers. First we test whether adjusting model predictions to put them on a canonical scale makes the ensembles more effective. Second, we explore the performance of ensemble selection when different amounts of data are available for ensemble hillclimbing. Third, we quantify the benefit of ensemble selection´s ability to optimize to arbitrary metrics. Fourth, we study the performance impact of pruning the number of models available for ensemble selection. Based on our results we present improved ensemble selection methods that double the benefit of the original method.
Keywords :
learning (artificial intelligence); pattern classification; arbitrary metrics; canonical scale; ensemble selection; model predictions; Art; Calibration; Computer science; Natural language processing; Optimization methods; Particle measurements; Predictive models; Testing; Training data; Wrapping;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
ISSN :
1550-4786
Print_ISBN :
0-7695-2701-7
Type :
conf
DOI :
10.1109/ICDM.2006.76
Filename :
4053111
Link To Document :
بازگشت