Title of article :
Out-of-bag estimation of the optimal sample size in bagging
Author/Authors :
ءngel and Martيnez-Muٌoz، نويسنده , , Gonzalo and Suلrez، نويسنده , , Alberto، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2010
Pages :
10
From page :
143
To page :
152
Abstract :
The performance of m -out-of- n bagging with and without replacement in terms of the sampling ratio ( m / n ) is analyzed. Standard bagging uses resampling with replacement to generate bootstrap samples of equal size as the original training set m wor = n . Without-replacement methods typically use half samples m wr = n / 2 . These choices of sampling sizes are arbitrary and need not be optimal in terms of the classification performance of the ensemble. We propose to use the out-of-bag estimates of the generalization accuracy to select a near-optimal value for the sampling ratio. Ensembles of classifiers trained on independent samples whose size is such that the out-of-bag error of the ensemble is as low as possible generally improve the performance of standard bagging and can be efficiently built.
Keywords :
Bagging , Subagging , Optimal sampling ratio , Bootstrap sampling , decision trees , Subsampling , Ensembles of classifiers
Journal title :
PATTERN RECOGNITION
Serial Year :
2010
Journal title :
PATTERN RECOGNITION
Record number :
1733090
Link To Document :
بازگشت