• DocumentCode
    1625946
  • Title

    A new method based on determining error surface for designing three layer neural networks

  • Author

    Lu, Baiquan ; Hirasawa, Kotaro ; Murata, Junichi ; Hu, Jinlu ; Jin, ChunZhin

  • Author_Institution
    Dept. of Electr. & Electron. Syst. Eng., Kyushu Univ., Fukuoka, Japan
  • Volume
    3
  • fYear
    1999
  • fDate
    6/21/1905 12:00:00 AM
  • Firstpage
    384
  • Abstract
    A method is proposed for designing three layer neural networks that assures global minimization of errors. The commonly used gradient-based learning algorithm suffers form the local minima problem, however, it can be solved if the error surface becomes convex. In the paper a number of possible network structures are provided together with their gradient-based learning algorithms. For a given set of training data, an appropriate network structure, i.e. the number of hidden nodes, the types of activation function, and the connections between them, is determined. All of the proposed structures give convex error surfaces and thus solve the local minima problem. The difference between them is in the level of locality and generalization ability. A numerical example is provided that supports the present approach
  • Keywords
    gradient methods; learning (artificial intelligence); matrix algebra; minimisation; multilayer perceptrons; neural net architecture; activation function; convex error surfaces; generalization ability; global minimization; gradient-based learning algorithm; hidden nodes; local minima problem; locality; network structures; three layer neural networks; Artificial neural networks; Design engineering; Design methodology; Electronic mail; Equations; Information science; Minimization methods; Neural networks; Systems engineering and theory; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
  • Conference_Location
    Tokyo
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-5731-0
  • Type

    conf

  • DOI
    10.1109/ICSMC.1999.823235
  • Filename
    823235