• DocumentCode
    2717636
  • Title

    Toward effective combination of off-line and on-line training in ADP framework

  • Author

    Prokhorov, Danil

  • Author_Institution
    Toyota Tech. Center, Ann Arbor, MI
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    268
  • Lastpage
    271
  • Abstract
    We are interested in finding the most effective combination between off-line and on-line/real-time training in approximate dynamic programming. We introduce our approach of combining proven off-line methods of training for robustness with a group of on-line methods. Training for robustness is carried out on reasonably accurate models with the multi-stream Kalman filter method (Feldkamp et al., 1998), whereas on-line adaptation is performed either with the help of a critic or by methods resembling reinforcement learning. We also illustrate importance of using recurrent neural networks for both controller/actor and critic
  • Keywords
    Kalman filters; dynamic programming; learning (artificial intelligence); neurocontrollers; recurrent neural nets; robust control; approximate dynamic programming; multistream Kalman filter; offline training; online training; real-time training; recurrent neural network; reinforcement learning; robustness training; Adaptive control; Dynamic programming; Learning; Neural networks; Neurocontrollers; Programmable control; Recurrent neural networks; Robust control; Robustness; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Approximate Dynamic Programming and Reinforcement Learning, 2007. ADPRL 2007. IEEE International Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0706-0
  • Type

    conf

  • DOI
    10.1109/ADPRL.2007.368198
  • Filename
    4220843