Crogging (cross-validation aggregation) for forecasting — A novel algorithm of neural network ensembles on time series subsamples

Author

Barrow, Devon K. ; Crone, Sven F.

Author_Institution

Lancaster Centre for Forecasting, Lancaster Univ., Lancaster, UK

fYear

2013

fDate

4-9 Aug. 2013

Firstpage

1

Lastpage

8

Abstract

In classification, regression and time series prediction alike, cross-validation is widely employed to estimate the expected accuracy of a predictive algorithm by averaging predictive errors across mutually exclusive subsamples of the data. Similarly, bootstrapping aims to increase the validity of estimating the expected accuracy by repeatedly sub-sampling the data with replacement, creating overlapping samples of the data. Estimates are then used to anticipate of future risk in decision making, or to guide model selection where multiple candidates are feasible. Beyond error estimation, bootstrapping has recently been extended to combine each of the diverse models created for estimation, and aggregating over each of their predictions (rather than their errors), coined bootstrap aggregation or bagging. However, similar extensions of cross-validation to create diverse forecasting models have not been considered. In accordance with bagging, we propose to combine the benefits of cross-validation and forecast aggregation, i.e. crogging. We assesses different levels of cross-validation, including a (single-fold) hold-out approach, 2-fold and 10-fold cross validation and Monte-Carlos cross validation, to create diverse base-models of neural networks for time series prediction trained on different data subsets, and average their individual multiple-step ahead predictions. Results of forecasting the 111 time series of the NN3 competition indicate significant improvements accuracy through Crogging relative to Bagging or individual model selection of neural networks.

Keywords

Monte Carlo methods; decision making; forecasting theory; neural nets; pattern classification; regression analysis; time series; 10-fold cross validation; 2-fold cross validation; Monte-Carlo cross validation; bootstrap aggregation; bootstrap bagging; classification; crogging; cross-validation aggregation; data subsets; decision making; error estimation; expected accuracy estimation; forecast aggregation; forecasting model; individual model selection; model selection; neural network base models; neural network ensembles; predictive algorithm; predictive errors; regression analysis; single-fold hold-out approach; time series prediction; time series subsamples; Accuracy; Artificial neural networks; Bagging; Forecasting; Predictive models; Time series analysis; Training;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks (IJCNN), The 2013 International Joint Conference on

Conference_Location

Dallas, TX

ISSN

2161-4393

Print_ISBN

978-1-4673-6128-6

Type

conf

DOI

10.1109/IJCNN.2013.6706740

Filename

6706740