Title :
The effect of initial weight, learning rate and regularization on generalization performance and efficiency
Author :
Yan, Wu ; Limimg, Zhang
Author_Institution :
Dept. of Comput. Sci. & Eng., Tongji Univ., Shanghai, China
Abstract :
The goal of this paper is to study the factors that affect the generalization performance and efficiency for neural network learning. First, this paper investigates the effect of initial weight ranges, learning rate, and regularization coefficient on generalization performance and learning speed. Based on this, we propose a hybrid method that simultaneously considers these three factors, and dynamically tunes the learning rate and regularization coefficient. Then we present the results of some experimental comparisons among these kinds of methods in several different problems. Finally, we draw conclusions and make plans for future work.
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); neural nets; efficiency; generalization performance; hybrid method; initial weight ranges; learning rate; learning speed; neural network learning; regularization coefficient; Adaptive signal processing; Adaptive systems; Communication system control; Learning systems; Neural networks; Nonlinear control systems; Pattern recognition; Process control; Real time systems; Signal processing algorithms;
Conference_Titel :
Signal Processing, 2002 6th International Conference on
Print_ISBN :
0-7803-7488-6
DOI :
10.1109/ICOSP.2002.1180003