• Title of article

    Out-of-bag estimation of the optimal sample size in bagging

  • Author/Authors

    ءngel and Martيnez-Muٌoz، نويسنده , , Gonzalo and Suلrez، نويسنده , , Alberto، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2010
  • Pages
    10
  • From page
    143
  • To page
    152
  • Abstract
    The performance of m -out-of- n bagging with and without replacement in terms of the sampling ratio ( m / n ) is analyzed. Standard bagging uses resampling with replacement to generate bootstrap samples of equal size as the original training set m wor = n . Without-replacement methods typically use half samples m wr = n / 2 . These choices of sampling sizes are arbitrary and need not be optimal in terms of the classification performance of the ensemble. We propose to use the out-of-bag estimates of the generalization accuracy to select a near-optimal value for the sampling ratio. Ensembles of classifiers trained on independent samples whose size is such that the out-of-bag error of the ensemble is as low as possible generally improve the performance of standard bagging and can be efficiently built.
  • Keywords
    Bagging , Subagging , Optimal sampling ratio , Bootstrap sampling , decision trees , Subsampling , Ensembles of classifiers
  • Journal title
    PATTERN RECOGNITION
  • Serial Year
    2010
  • Journal title
    PATTERN RECOGNITION
  • Record number

    1733090