DocumentCode
3220050
Title
Primitive adaptive critics
Author
Prokhorov, Danil V. ; Feldkamp, Lee A.
Author_Institution
Dept. of Electr. Eng., Texas Tech. Univ., Lubbock, TX, USA
Volume
4
fYear
1997
fDate
9-12 Jun 1997
Firstpage
2263
Abstract
We propose a simple framework for critic-based training of recurrent neural networks and feedback controllers. We term the critics that are used primitive adaptive critics, since we represent them with the simplest possible architecture (bias weight only). We derive this framework from two main premises. The first of these is a natural similarity between a form of approximate dynamic programming, called dual heuristic programming, and backpropagation through time (BPTT), which are discussed. The second premise is our emphasis on a development of a truly online critic-based training procedure competitive in performance and computational cost to truncated BPTT. Three examples illustrate the main features of the framework proposed
Keywords
backpropagation; duality (mathematics); dynamic programming; feedback; model reference adaptive control systems; neurocontrollers; real-time systems; recurrent neural nets; approximate dynamic programming; backpropagation through time; critic-based learning; dual heuristic programming; feedback controllers; model reference adaptive control; online learning; primitive adaptive critics; recurrent neural networks; Adaptive control; Backpropagation; Computational efficiency; Computational intelligence; Cost function; Dynamic programming; Equations; Function approximation; Laboratories; Recurrent neural networks;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks,1997., International Conference on
Conference_Location
Houston, TX
Print_ISBN
0-7803-4122-8
Type
conf
DOI
10.1109/ICNN.1997.614396
Filename
614396
Link To Document