• DocumentCode
    2260476
  • Title

    Training feedforward neural networks using orthogonal iteration of the Hessian eigenvectors

  • Author

    Hunter, Andrew

  • Author_Institution
    Dept. of Comput. & Eng. Technol., Sunderland Univ., UK
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    173
  • Abstract
    The paper describes a training algorithm for multilayer perceptrons. It has scalable memory requirements, which may range from O(W) to O(W2), although in practice the useful range is limited to lower complexity levels. The algorithm is based upon a novel iterative estimation of the principal eigensubspace of the Hessian, together with a quadratic step estimation procedure. It is shown that the new algorithm has convergence time comparable to conjugate gradient descent, and may be preferable if early stopping is used as it converges more quickly during the initial phases. Results of experiments to confirm the algorithm´s performance are presented
  • Keywords
    Hessian matrices; computational complexity; convergence; eigenvalues and eigenfunctions; feedforward neural nets; iterative methods; learning (artificial intelligence); multilayer perceptrons; Hessian eigenvectors; convergence time; feedforward neural network training; iterative estimation; multilayer perceptrons; orthogonal iteration; principal eigensubspace; quadratic step estimation procedure; scalable memory requirements; Approximation algorithms; Computer networks; Convergence; Eigenvalues and eigenfunctions; Feedforward neural networks; Gradient methods; Multi-layer neural network; Neural networks; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
  • Conference_Location
    Como
  • ISSN
    1098-7576
  • Print_ISBN
    0-7695-0619-4
  • Type

    conf

  • DOI
    10.1109/IJCNN.2000.857893
  • Filename
    857893