DocumentCode
1240239
Title
An experimental bias-variance analysis of SVM ensembles based on resampling techniques
Author
Valentini, Giorgio
Author_Institution
DSI-Dipt. di Sci. dell´´Informazione, Univ. degli Studi di Milano, Italy
Volume
35
Issue
6
fYear
2005
Firstpage
1252
Lastpage
1271
Abstract
Recently, bias-variance decomposition of error has been used as a tool to study the behavior of learning algorithms and to develop new ensemble methods well suited to the bias-variance characteristics of base learners. We propose methods and procedures, based on Domingo´s unified bias-variance theory, to evaluate and quantitatively measure the bias-variance decomposition of error in ensembles of learning machines. We apply these methods to study and compare the bias-variance characteristics of single support vector machines (SVMs) and ensembles of SVMs based on resampling techniques, and their relationships with the cardinality of the training samples. In particular, we present an experimental bias-variance analysis of bagged and random aggregated ensembles of SVMs in order to verify their theoretical variance reduction properties. The experimental bias-variance analysis quantitatively characterizes the relationships between bagging and random aggregating, and explains the reasons why ensembles built on small subsamples of the data work with large databases. Our analysis also suggests new directions for research to improve on classical bagging.
Keywords
covariance analysis; sampling methods; support vector machines; SVM ensembles; bias-variance analysis; large databases; learning algorithm; random aggregating; resampling techniques; support vector machines; Analysis of variance; Bagging; Data analysis; Databases; Decision trees; Kernel; Machine learning; Statistical analysis; Stochastic processes; Support vector machines; Bagging; bias-variance analysis; ensemble of learning machines; support vector machines; Algorithms; Analysis of Variance; Artificial Intelligence; Computer Simulation; Data Interpretation, Statistical; Models, Statistical; Pattern Recognition, Automated; Signal Processing, Computer-Assisted;
fLanguage
English
Journal_Title
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
1083-4419
Type
jour
DOI
10.1109/TSMCB.2005.850183
Filename
1542270
Link To Document