DocumentCode
671401
Title
Crogging (cross-validation aggregation) for forecasting — A novel algorithm of neural network ensembles on time series subsamples
Author
Barrow, Devon K. ; Crone, Sven F.
Author_Institution
Lancaster Centre for Forecasting, Lancaster Univ., Lancaster, UK
fYear
2013
fDate
4-9 Aug. 2013
Firstpage
1
Lastpage
8
Abstract
In classification, regression and time series prediction alike, cross-validation is widely employed to estimate the expected accuracy of a predictive algorithm by averaging predictive errors across mutually exclusive subsamples of the data. Similarly, bootstrapping aims to increase the validity of estimating the expected accuracy by repeatedly sub-sampling the data with replacement, creating overlapping samples of the data. Estimates are then used to anticipate of future risk in decision making, or to guide model selection where multiple candidates are feasible. Beyond error estimation, bootstrapping has recently been extended to combine each of the diverse models created for estimation, and aggregating over each of their predictions (rather than their errors), coined bootstrap aggregation or bagging. However, similar extensions of cross-validation to create diverse forecasting models have not been considered. In accordance with bagging, we propose to combine the benefits of cross-validation and forecast aggregation, i.e. crogging. We assesses different levels of cross-validation, including a (single-fold) hold-out approach, 2-fold and 10-fold cross validation and Monte-Carlos cross validation, to create diverse base-models of neural networks for time series prediction trained on different data subsets, and average their individual multiple-step ahead predictions. Results of forecasting the 111 time series of the NN3 competition indicate significant improvements accuracy through Crogging relative to Bagging or individual model selection of neural networks.
Keywords
Monte Carlo methods; decision making; forecasting theory; neural nets; pattern classification; regression analysis; time series; 10-fold cross validation; 2-fold cross validation; Monte-Carlo cross validation; bootstrap aggregation; bootstrap bagging; classification; crogging; cross-validation aggregation; data subsets; decision making; error estimation; expected accuracy estimation; forecast aggregation; forecasting model; individual model selection; model selection; neural network base models; neural network ensembles; predictive algorithm; predictive errors; regression analysis; single-fold hold-out approach; time series prediction; time series subsamples; Accuracy; Artificial neural networks; Bagging; Forecasting; Predictive models; Time series analysis; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks (IJCNN), The 2013 International Joint Conference on
Conference_Location
Dallas, TX
ISSN
2161-4393
Print_ISBN
978-1-4673-6128-6
Type
conf
DOI
10.1109/IJCNN.2013.6706740
Filename
6706740
Link To Document