DocumentCode
1077763
Title
On the problem of local minima in backpropagation
Author
Gori, Marco ; Tesi, Alberto
Author_Institution
Dipartimento di Sistemi e Inf., Firenze Univ., Italy
Volume
14
Issue
1
fYear
1992
fDate
1/1/1992 12:00:00 AM
Firstpage
76
Lastpage
86
Abstract
The authors propose a theoretical framework for backpropagation (BP) in order to identify some of its limitations as a general learning procedure and the reasons for its success in several experiments on pattern recognition. The first important conclusion is that examples can be found in which BP gets stuck in local minima. A simple example in which BP can get stuck during gradient descent without having learned the entire training set is presented. This example guarantees the existence of a solution with null cost. Some conditions on the network architecture and the learning environment that ensure the convergence of the BP algorithm are proposed. It is proven in particular that the convergence holds if the classes are linearly separable. In this case, the experience gained in several experiments shows that multilayered neural networks (MLNs) exceed perceptrons in generalization to new examples
Keywords
learning systems; neural nets; pattern recognition; backpropagation; convergence; learning systems; local minima; multilayered neural networks; network architecture; pattern recognition; perceptrons; Backpropagation algorithms; Convergence; Gradient methods; Intelligent networks; Multi-layer neural network; Neural networks; Neurons; Pattern recognition; Speech recognition; Workstations;
fLanguage
English
Journal_Title
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher
ieee
ISSN
0162-8828
Type
jour
DOI
10.1109/34.107014
Filename
107014
Link To Document