Title :
Localized Generalization Error Model and Its Application to Architecture Selection for Radial Basis Function Neural Network
Author :
Yeung, Daniel S. ; Ng, Wing W Y ; Wang, Defeng ; Tsang, Eric C C ; Wang, Xi-Zhao
Author_Institution :
Harbin Inst.of Technol., Shenzhen
Abstract :
The generalization error bounds found by current error models using the number of effective parameters of a classifier and the number of training samples are usually very loose. These bounds are intended for the entire input space. However, support vector machine (SVM), radial basis function neural network (RBFNN), and multilayer perceptron neural network (MLPNN) are local learning machines for solving problems and treat unseen samples near the training samples to be more important. In this paper, we propose a localized generalization error model which bounds from above the generalization error within a neighborhood of the training samples using stochastic sensitivity measure. It is then used to develop an architecture selection technique for a classifier with maximal coverage of unseen samples by specifying a generalization error threshold. Experiments using 17 University of California at Irvine (UCI) data sets show that, in comparison with cross validation (CV), sequential learning, and two other ad hoc methods, our technique consistently yields the best testing classification accuracy with fewer hidden neurons and less training time.
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); multilayer perceptrons; pattern classification; radial basis function networks; support vector machines; ad hoc methods; architecture selection; hidden neurons; local learning machines; localized generalization error model; multilayer perceptron neural network; radial basis function neural network; sequential learning; support vector machine; Machine learning; Multi-layer neural network; Multilayer perceptrons; Neural networks; Neurons; Radial basis function networks; Sequential analysis; Stochastic processes; Support vector machine classification; Support vector machines; Localized generalization error; network architecture selection; radial basis function neural network (RBFNN); sensitivity measure; Algorithms; Computer Simulation; Models, Statistical; Neural Networks (Computer); Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity;
Journal_Title :
Neural Networks, IEEE Transactions on
DOI :
10.1109/TNN.2007.894058