مرکز منطقه ای اطلاع رساني علوم و فناوري - Aitken-based acceleration methods for assessing convergence of multilayer neural networks

DocumentCode :

1528652

Title :

Aitken-based acceleration methods for assessing convergence of multilayer neural networks

Author :

Pilla, Ramani S. ; Kamarthi, Sagar V. ; Lindsay, Bruce G.

Author_Institution :

Div. of Epidemiology & Biostat., Illinois Univ., Chicago, IL, USA

Volume :

Issue :

fYear :

2001

fDate :

9/1/2001 12:00:00 AM

Firstpage :

998

Lastpage :

1012

Abstract :

This paper first develops the ideas of Aitken δ² method to accelerate the rate of convergence of an error sequence (value of the objective function at each step) obtained by training a neural network with a sigmoidal activation function via the backpropagation algorithm. The Aitken method is exact when the error sequence is exactly geometric. However, theoretical and empirical evidence suggests that the best possible rate of convergence obtainable for such an error sequence is log-geometric. This paper develops a new invariant extended-Aitken acceleration method for accelerating log-geometric sequences. The resulting accelerated sequence enables one to predict the final value of the error function. These predictions can in turn be used to assess the distance between the current and final solution and thereby provides a stopping criterion for a desired accuracy. Each of the techniques described is applicable to a wide range of problems. The invariant extended-Aitken acceleration approach shows improved acceleration as well as outstanding prediction of the final error in the practical problems considered

Keywords :

backpropagation; convergence of numerical methods; extrapolation; feedforward neural nets; minimisation; Aitken method; backpropagation; convergence; cross validation; error sequence convergence; extrapolation; learning; log-geometric sequences; minimisation; multilayer neural networks; Acceleration; Backpropagation algorithms; Closed-form solution; Convergence; Force measurement; Iterative algorithms; Iterative methods; Multi-layer neural network; Neural networks; Tail;

fLanguage :

English

Journal_Title :

Neural Networks, IEEE Transactions on

Publisher :

ieee

ISSN :

1045-9227

Type :

jour

DOI :

10.1109/72.950130

Filename :

950130

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1528652