• DocumentCode
    330296
  • Title

    The square root Kalman filter training of recurrent neural networks

  • Author

    Sun, Pu ; Marko, Kenneth

  • Author_Institution
    Ford Res. Lab., Ford Motor Co., Dearborn, MI, USA
  • Volume
    2
  • fYear
    1998
  • fDate
    11-14 Oct 1998
  • Firstpage
    1645
  • Abstract
    The conventional Kalman filter suffers from the problem that the covariance matrix may not remain positive definite. In using the filter to train neural networks, the consequence of this problem is both a loss of efficiency during training and an eventual blow up. This problem has long been recognized in the application of the filter in signal processing and several forms of a square root Kalman filter have been derived to overcome it. However, the existing square root Kalman filter algorithms cannot incorporate the learning rate used in neural network training. In many situations, the incorporation of the learning rate into the training algorithms is crucial to obtaining excellent results in training time lagged recurrent neural networks (TLRNN). In this paper, a new square root Kalman filter equation is derived to train TLRNN which allows a user to incorporate the learning rate into the neural network training. With this new square root Kalman filter algorithm, we are able to train a neural network to convergence on a large and complex problem related to misfire diagnostics over a very large number of cycles, to produce extremely high classification accuracy on the diagnostic task. This training proceeded without any training problems we often experienced when a standard Kalman filter training algorithm was used for identical initializations and structures of the network. Furthermore, the neural network trained by the new method outperforms the one trained with the conventional Kalman filter algorithm by almost a factor of two
  • Keywords
    Kalman filters; covariance matrices; delays; filtering theory; learning (artificial intelligence); recurrent neural nets; TLRNN; blow-up; classification accuracy; convergence; covariance matrix; efficiency loss; initializations; misfire diagnostics; recurrent neural network training; square root Kalman filter; time lagged recurrent neural networks; Convergence; Covariance matrix; Decision theory; Equations; Estimation theory; Laboratories; Neural networks; Recurrent neural networks; Signal processing algorithms; Sun;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1998. 1998 IEEE International Conference on
  • Conference_Location
    San Diego, CA
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-4778-1
  • Type

    conf

  • DOI
    10.1109/ICSMC.1998.728125
  • Filename
    728125