DocumentCode
2717636
Title
Toward effective combination of off-line and on-line training in ADP framework
Author
Prokhorov, Danil
Author_Institution
Toyota Tech. Center, Ann Arbor, MI
fYear
2007
fDate
1-5 April 2007
Firstpage
268
Lastpage
271
Abstract
We are interested in finding the most effective combination between off-line and on-line/real-time training in approximate dynamic programming. We introduce our approach of combining proven off-line methods of training for robustness with a group of on-line methods. Training for robustness is carried out on reasonably accurate models with the multi-stream Kalman filter method (Feldkamp et al., 1998), whereas on-line adaptation is performed either with the help of a critic or by methods resembling reinforcement learning. We also illustrate importance of using recurrent neural networks for both controller/actor and critic
Keywords
Kalman filters; dynamic programming; learning (artificial intelligence); neurocontrollers; recurrent neural nets; robust control; approximate dynamic programming; multistream Kalman filter; offline training; online training; real-time training; recurrent neural network; reinforcement learning; robustness training; Adaptive control; Dynamic programming; Learning; Neural networks; Neurocontrollers; Programmable control; Recurrent neural networks; Robust control; Robustness; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
Conference_Location
Honolulu, HI
Print_ISBN
1-4244-0706-0
Type
conf
DOI
10.1109/ADPRL.2007.368198
Filename
4220843
Link To Document