Title :
A bootstrap evaluation of the effect of data splitting on financial time series
Author :
LeBaron, Blake ; Weigend, Andreas S.
Author_Institution :
Dept. of Econ., Wisconsin Univ., Madison, WI, USA
fDate :
1/1/1998 12:00:00 AM
Abstract :
Exposes problems of the commonly used technique of splitting the available data into training, validation, and test sets that are held fixed, warns about drawing too strong conclusions from such static splits, and shows potential pitfalls of ignoring variability across splits. Using a bootstrap or resampling method, we compare the uncertainty in the solution stemming from the data splitting with neural-network specific uncertainties (parameter initialization, choice of number of hidden units, etc.). We present two results on data from the New York Stock Exchange. First, the variation due to different resamplings is significantly larger than the variation due to different network conditions. This result implies that it is important to not over-interpret a model (or an ensemble of models) estimated on one specific split of the data. Second, on each split, the neural-network solution with early stopping is very close to a linear model; no significant nonlinearities are extracted
Keywords :
finance; learning (artificial intelligence); neural nets; stock markets; time series; New York Stock Exchange; bootstrap evaluation; data splitting; financial time series; neural-network specific uncertainties; resampling method; static splits; Data mining; Economic forecasting; Information systems; Merging; Parameter estimation; Power generation economics; Predictive models; Stock markets; Testing; Uncertainty;
Journal_Title :
Neural Networks, IEEE Transactions on