Title :
Optimal initialization for multi-layer perceptrons
Author :
Chen, Yiwei ; Bastani, Farokh
Author_Institution :
Western Geophys., Houston, TX, USA
Abstract :
An optimal weight initialization algorithm for three-layer networks, called the Walsh-Hermite algorithm, is introduced. The weights between the input layer and the hidden layer are vectors of the Hadamard matrix. Each neuron of the hidden layer has a different discriminator function, which is a base function of the Hermite spline of degree three. The weights between the hidden layer and the third layer are scaled coefficients of Hermite interpolations. Experiments show that the convergence speed of the network with weights so initialized is quite close to that of the best initialized net. Compared to the optimal estimate training algorithm, the Walsh-Hermite algorithm is theoretically more reasonable, is cost effective, and does not confine the number of units in the hidden layer
Keywords :
convergence of numerical methods; interpolation; neural nets; optimisation; parallel algorithms; splines (mathematics); Hadamard matrix; Hermite interpolations; Hermite spline; Walsh-Hermite algorithm; multilayer perceptron; neural nets; optimal weight initialization algorithm; three-layer networks; Artificial neural networks; Biology computing; Convergence; Estimation theory; Geophysics computing; Interpolation; Marine vehicles; Multilayer perceptrons; Neurons; Spline;
Conference_Titel :
Systems, Man and Cybernetics, 1990. Conference Proceedings., IEEE International Conference on
Conference_Location :
Los Angeles, CA
Print_ISBN :
0-87942-597-0
DOI :
10.1109/ICSMC.1990.142129