• DocumentCode
    1360127
  • Title

    Reinforcement learning and adaptive dynamic programming for feedback control

  • Author

    Lewis, F.L. ; Vrabie, D.

  • Author_Institution
    Univ. of Texas at Arlington, Arlington, TX, USA
  • Volume
    9
  • Issue
    3
  • fYear
    2009
  • Firstpage
    32
  • Lastpage
    50
  • Abstract
    Living organisms learn by acting on their environment, observing the resulting reward stimulus, and adjusting their actions accordingly to improve the reward. This action-based or reinforcement learning can capture notions of optimal behavior occurring in natural systems. We describe mathematical formulations for reinforcement learning and a practical implementation method known as adaptive dynamic programming. These give us insight into the design of controllers for man-made engineered systems that both learn and exhibit optimal behavior.
  • Keywords
    control engineering computing; control system synthesis; dynamic programming; feedback; learning (artificial intelligence); optimal control; adaptive dynamic programming; controller design; feedback control; man-made engineered system; natural system; optimal behavior; reinforcement learning; reward stimulus; Adaptive control; Control systems; Design engineering; Dynamic programming; Feedback control; Learning; Optimal control; Organisms; Programmable control; Systems engineering and theory;
  • fLanguage
    English
  • Journal_Title
    Circuits and Systems Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    1531-636X
  • Type

    jour

  • DOI
    10.1109/MCAS.2009.933854
  • Filename
    5227780