Analysis of gradient descent learning algorithms for multilayer feedforward neural networks

Author

Guo, Heng ; Gelfand, Saul B.

Author_Institution

Sch. of Electr. Eng., Purdue Univ., West Lafayette, IN, USA

Volume

38

Issue

8

fYear

1991

fDate

8/1/1991 12:00:00 AM

Firstpage

883

Lastpage

894

Abstract

Certain dynamical properties of gradient-type learning algorithms as they apply to multilayer feedforward neural networks are investigated. These properties are more related to the multilayer structure of the net than to the particular threshold units at the nodes. The analysis explains the empirical observation that the weight sequence generated by backpropagation and related stochastic gradient algorithms exhibits a long-term dependence on the initial choice of weights, and also a continued growth and/or drift long after the outputs have converged. The analysis is carried out in two steps. First, a simplified deterministic algorithm is derived using a describing function-type approach. Next, an analysis of the simplified algorithm is performed by considering an associated ordinary differential equation (ODE). Some numerical examples are given to illustrate the analysis. The dynamical behavior of backpropagation and related algorithms for the training of multilayer nets is discussed

Keywords

neural nets; backpropagation; dynamical properties; gradient descent learning algorithms; multilayer feedforward neural networks; numerical examples; ordinary differential equation; training; Algorithm design and analysis; Backpropagation algorithms; Equations; Feedforward neural networks; Multi-layer neural network; Neural networks; Neurons; Nonhomogeneous media; Performance analysis; Stochastic processes;

fLanguage

English

Journal_Title

Circuits and Systems, IEEE Transactions on

Publisher

ieee

ISSN

0098-4094

Type

jour

DOI

10.1109/31.85630

Filename

85630