Title :
A new method based on determining error surface for designing three layer neural networks
Author :
Lu, Baiquan ; Hirasawa, Kotaro ; Murata, Junichi ; Hu, Jinlu ; Jin, ChunZhin
Author_Institution :
Dept. of Electr. & Electron. Syst. Eng., Kyushu Univ., Fukuoka, Japan
fDate :
6/21/1905 12:00:00 AM
Abstract :
A method is proposed for designing three layer neural networks that assures global minimization of errors. The commonly used gradient-based learning algorithm suffers form the local minima problem, however, it can be solved if the error surface becomes convex. In the paper a number of possible network structures are provided together with their gradient-based learning algorithms. For a given set of training data, an appropriate network structure, i.e. the number of hidden nodes, the types of activation function, and the connections between them, is determined. All of the proposed structures give convex error surfaces and thus solve the local minima problem. The difference between them is in the level of locality and generalization ability. A numerical example is provided that supports the present approach
Keywords :
gradient methods; learning (artificial intelligence); matrix algebra; minimisation; multilayer perceptrons; neural net architecture; activation function; convex error surfaces; generalization ability; global minimization; gradient-based learning algorithm; hidden nodes; local minima problem; locality; network structures; three layer neural networks; Artificial neural networks; Design engineering; Design methodology; Electronic mail; Equations; Information science; Minimization methods; Neural networks; Systems engineering and theory; Training data;
Conference_Titel :
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-7803-5731-0
DOI :
10.1109/ICSMC.1999.823235