DocumentCode
1366263
Title
Analysis of gradient descent learning algorithms for multilayer feedforward neural networks
Author
Guo, Heng ; Gelfand, Saul B.
Author_Institution
Sch. of Electr. Eng., Purdue Univ., West Lafayette, IN, USA
Volume
38
Issue
8
fYear
1991
fDate
8/1/1991 12:00:00 AM
Firstpage
883
Lastpage
894
Abstract
Certain dynamical properties of gradient-type learning algorithms as they apply to multilayer feedforward neural networks are investigated. These properties are more related to the multilayer structure of the net than to the particular threshold units at the nodes. The analysis explains the empirical observation that the weight sequence generated by backpropagation and related stochastic gradient algorithms exhibits a long-term dependence on the initial choice of weights, and also a continued growth and/or drift long after the outputs have converged. The analysis is carried out in two steps. First, a simplified deterministic algorithm is derived using a describing function-type approach. Next, an analysis of the simplified algorithm is performed by considering an associated ordinary differential equation (ODE). Some numerical examples are given to illustrate the analysis. The dynamical behavior of backpropagation and related algorithms for the training of multilayer nets is discussed
Keywords
neural nets; backpropagation; dynamical properties; gradient descent learning algorithms; multilayer feedforward neural networks; numerical examples; ordinary differential equation; training; Algorithm design and analysis; Backpropagation algorithms; Equations; Feedforward neural networks; Multi-layer neural network; Neural networks; Neurons; Nonhomogeneous media; Performance analysis; Stochastic processes;
fLanguage
English
Journal_Title
Circuits and Systems, IEEE Transactions on
Publisher
ieee
ISSN
0098-4094
Type
jour
DOI
10.1109/31.85630
Filename
85630
Link To Document