Title :
Neural-network design for small training sets of high dimension
Author :
Yuan, Jen-Lun ; Fine, Terrence L.
Author_Institution :
Nat. Chung-Hsing Univ., Taichung, Taiwan
fDate :
3/1/1998 12:00:00 AM
Abstract :
We introduce a statistically based methodology for the design of neural networks when the dimension d of the network input is comparable to the size n of the training set. If one proceeds straightforwardly, then one is committed to a network of complexity exceeding n. The result will be good performance on the training set but poor generalization performance when the network is presented with new data. To avoid this we need to select carefully the network architecture, including control over the input variables. Our approach to selecting a network architecture first selects a subset of input variables (features) using the nonparametric statistical process of difference-based variance estimation and then selects a simple network architecture using projection pursuit regression (PPR) ideas combined with the statistical idea of slicing inverse regression (SIR). The resulting network, which is then retrained without regard to the PPR/SIR determined parameters, is one of moderate complexity (number of parameters significantly less than n) whose performance on the training set can be expected to generalize well. The application of this methodology is illustrated in detail in the context of short-term forecasting of the demand for electric power from an electric utility
Keywords :
learning (artificial intelligence); load forecasting; neural net architecture; nonparametric statistics; statistical analysis; difference-based variance estimation; electric power demand; generalization performance; high dimension small training sets; network architecture; neural network design; nonparametric statistical process; projection pursuit regression; short-term forecasting; slicing inverse regression; statistically based methodology; Bayesian methods; Design methodology; Economic forecasting; Input variables; Load forecasting; Neural networks; Power generation economics; Power industry; Statistics; Training data;
Journal_Title :
Neural Networks, IEEE Transactions on