• DocumentCode
    3220050
  • Title

    Primitive adaptive critics

  • Author

    Prokhorov, Danil V. ; Feldkamp, Lee A.

  • Author_Institution
    Dept. of Electr. Eng., Texas Tech. Univ., Lubbock, TX, USA
  • Volume
    4
  • fYear
    1997
  • fDate
    9-12 Jun 1997
  • Firstpage
    2263
  • Abstract
    We propose a simple framework for critic-based training of recurrent neural networks and feedback controllers. We term the critics that are used primitive adaptive critics, since we represent them with the simplest possible architecture (bias weight only). We derive this framework from two main premises. The first of these is a natural similarity between a form of approximate dynamic programming, called dual heuristic programming, and backpropagation through time (BPTT), which are discussed. The second premise is our emphasis on a development of a truly online critic-based training procedure competitive in performance and computational cost to truncated BPTT. Three examples illustrate the main features of the framework proposed
  • Keywords
    backpropagation; duality (mathematics); dynamic programming; feedback; model reference adaptive control systems; neurocontrollers; real-time systems; recurrent neural nets; approximate dynamic programming; backpropagation through time; critic-based learning; dual heuristic programming; feedback controllers; model reference adaptive control; online learning; primitive adaptive critics; recurrent neural networks; Adaptive control; Backpropagation; Computational efficiency; Computational intelligence; Cost function; Dynamic programming; Equations; Function approximation; Laboratories; Recurrent neural networks;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks,1997., International Conference on
  • Conference_Location
    Houston, TX
  • Print_ISBN
    0-7803-4122-8
  • Type

    conf

  • DOI
    10.1109/ICNN.1997.614396
  • Filename
    614396