Toward effective combination of off-line and on-line training in ADP framework

Author

Prokhorov, Danil

Author_Institution

Toyota Tech. Center, Ann Arbor, MI

fYear

2007

fDate

1-5 April 2007

Firstpage

268

Lastpage

271

Abstract

We are interested in finding the most effective combination between off-line and on-line/real-time training in approximate dynamic programming. We introduce our approach of combining proven off-line methods of training for robustness with a group of on-line methods. Training for robustness is carried out on reasonably accurate models with the multi-stream Kalman filter method (Feldkamp et al., 1998), whereas on-line adaptation is performed either with the help of a critic or by methods resembling reinforcement learning. We also illustrate importance of using recurrent neural networks for both controller/actor and critic

Keywords

Kalman filters; dynamic programming; learning (artificial intelligence); neurocontrollers; recurrent neural nets; robust control; approximate dynamic programming; multistream Kalman filter; offline training; online training; real-time training; recurrent neural network; reinforcement learning; robustness training; Adaptive control; Dynamic programming; Learning; Neural networks; Neurocontrollers; Programmable control; Recurrent neural networks; Robust control; Robustness; Uncertainty;

fLanguage

English

Publisher

ieee

Conference_Titel

Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on

Conference_Location

Honolulu, HI

Print_ISBN

1-4244-0706-0

Type

conf

DOI

10.1109/ADPRL.2007.368198

Filename

4220843