DocumentCode
2263052
Title
Neural network learning through optimally conditioned quadratically convergent methods requiring NO LINE SEARCH
Author
Beigi, Homayoon S M
Author_Institution
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
1993
fDate
16-18 Aug 1993
Firstpage
109
Abstract
Neural network learning algorithms based on conjugate gradient techniques and quasi Newton techniques such as Broyden, DFP, BFGS, and SSVM algorithms require exact or inexact line searches in order to satisfy their convergence criteria. Line searches are very costly and slow down the learning process. This paper presents new neural network learning algorithms based on Hoshino´s weak line search technique and Davidon´s optimally conditioned line search free technique. Also, a practical method of using these optimization algorithms is presented such that they will avoid getting trapped in local minima for the most part. The global minimization problem is a serious one when quadratically convergent techniques such as quasi Newton methods are used. Furthermore, to display the performance of the proposed learning algorithms, the more practical algorithm based on Davidon´s minimization technique is used in conjunction with a cursive handwriting recognition problem. For comparison with other algorithms, also a few small benchmark tests are conducted and reported
Keywords
Newton method; convergence of numerical methods; feedforward neural nets; handwriting recognition; learning (artificial intelligence); minimisation; optimisation; Davidon minimization technique; conjugate gradient techniques; convergence criteria; cursive handwriting recognition problem; global minimization problem; line search elimination; neural network learning algorithms; optimally conditioned line search free technique; optimally conditioned quadratically convergent methods; optimization algorithms; quasi Newton techniques; weak line search technique; Backpropagation algorithms; Convergence; Displays; Electronic mail; Handwriting recognition; Minimization methods; Neural networks; Newton method; Optical character recognition software; Optimization methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 1993., Proceedings of the 36th Midwest Symposium on
Conference_Location
Detroit, MI
Print_ISBN
0-7803-1760-2
Type
conf
DOI
10.1109/MWSCAS.1993.343053
Filename
343053
Link To Document