• DocumentCode
    1240239
  • Title

    An experimental bias-variance analysis of SVM ensembles based on resampling techniques

  • Author

    Valentini, Giorgio

  • Author_Institution
    DSI-Dipt. di Sci. dell´´Informazione, Univ. degli Studi di Milano, Italy
  • Volume
    35
  • Issue
    6
  • fYear
    2005
  • Firstpage
    1252
  • Lastpage
    1271
  • Abstract
    Recently, bias-variance decomposition of error has been used as a tool to study the behavior of learning algorithms and to develop new ensemble methods well suited to the bias-variance characteristics of base learners. We propose methods and procedures, based on Domingo´s unified bias-variance theory, to evaluate and quantitatively measure the bias-variance decomposition of error in ensembles of learning machines. We apply these methods to study and compare the bias-variance characteristics of single support vector machines (SVMs) and ensembles of SVMs based on resampling techniques, and their relationships with the cardinality of the training samples. In particular, we present an experimental bias-variance analysis of bagged and random aggregated ensembles of SVMs in order to verify their theoretical variance reduction properties. The experimental bias-variance analysis quantitatively characterizes the relationships between bagging and random aggregating, and explains the reasons why ensembles built on small subsamples of the data work with large databases. Our analysis also suggests new directions for research to improve on classical bagging.
  • Keywords
    covariance analysis; sampling methods; support vector machines; SVM ensembles; bias-variance analysis; large databases; learning algorithm; random aggregating; resampling techniques; support vector machines; Analysis of variance; Bagging; Data analysis; Databases; Decision trees; Kernel; Machine learning; Statistical analysis; Stochastic processes; Support vector machines; Bagging; bias-variance analysis; ensemble of learning machines; support vector machines; Algorithms; Analysis of Variance; Artificial Intelligence; Computer Simulation; Data Interpretation, Statistical; Models, Statistical; Pattern Recognition, Automated; Signal Processing, Computer-Assisted;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2005.850183
  • Filename
    1542270