DocumentCode :
1115030
Title :
An Analysis of Ensemble Pruning Techniques Based on Ordered Aggregation
Author :
Martinez-Muoz, G. ; Hernandez-Lobato, Daniel ; Suarez, Almudena
Author_Institution :
Comput. Sci. Dept., Univ. Autonoma de Madrid, Cantoblanco
Volume :
31
Issue :
2
fYear :
2009
Firstpage :
245
Lastpage :
259
Abstract :
Several pruning strategies that can be used to reduce the size and increase the accuracy of bagging ensembles are analyzed. These heuristics select subsets of complementary classifiers that, when combined, can perform better than the whole ensemble. The pruning methods investigated are based on modifying the order of aggregation of classifiers in the ensemble. In the original bagging algorithm, the order of aggregation is left unspecified. When this order is random, the generalization error typically decreases as the number of classifiers in the ensemble increases. If an appropriate ordering for the aggregation process is devised, the generalization error reaches a minimum at intermediate numbers of classifiers. This minimum lies below the asymptotic error of bagging. Pruned ensembles are obtained by retaining a fraction of the classifiers in the ordered ensemble. The performance of these pruned ensembles is evaluated in several benchmark classification tasks under different training conditions. The results of this empirical investigation show that ordered aggregation can be used for the efficient generation of pruned ensembles that are competitive, in terms of performance and robustness of classification, with computationally more costly methods that directly select optimal or near-optimal subensembles.
Keywords :
aggregation; bagging; decision trees; learning (artificial intelligence); set theory; bagging algorithm; bagging ensembles; ensemble pruning techniques; heuristics select subsets; near-optimal subensembles; ordered aggregation; Bagging; Boosting; Classification tree analysis; Decision trees; Fluctuations; Noise robustness; Sampling methods; Size measurement; Training data; Bagging; Decision trees; Ensemble Pruning; Ensemble Selection; Ensembles of classifiers; Ordered Aggregation; bagging; decision trees; ensemble pruning; ensemble selection; ordered aggregation.; Algorithms; Artificial Intelligence; Computer Simulation; Decision Support Techniques; Models, Theoretical; Pattern Recognition, Automated;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2008.78
Filename :
4479485
Link To Document :
بازگشت