• DocumentCode
    328301
  • Title

    A new approach of adaptive reinforcement learning control

  • Author

    Yang, Boo-Ho ; Asada, Haruhiko

  • Author_Institution
    Dept. of Mech. Eng., MIT, Cambridge, MA, USA
  • Volume
    1
  • fYear
    1993
  • fDate
    25-29 Oct. 1993
  • Firstpage
    627
  • Abstract
    A new learning algorithm for connectionist networks that solves a class of optimal control problems is presented. The algorithm, called adaptive reinforcement learning algorithm, employs a second network to model immediate reinforcement provided from the task environment and adaptively identify it through experience. Output perturbation and correlation techniques are used to translate mere critic signals into useful learning signals for the connectionist controller. Compared with the direct approaches of reinforcement learning, this algorithm shows faster and guaranteed improvement in the control performance. Robustness against inaccuracy of the model is also discussed.
  • Keywords
    adaptive control; correlation methods; intelligent control; learning (artificial intelligence); neural nets; optimal control; adaptive reinforcement learning control; connectionist networks; critic signals; neural nets; optimal control; output correlation; output perturbation; Adaptive control; Control systems; Force measurement; Force sensors; Learning; Mechanical engineering; Mechanical systems; Optimal control; Programmable control; Robotic assembly;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1993. IJCNN '93-Nagoya. Proceedings of 1993 International Joint Conference on
  • Print_ISBN
    0-7803-1421-2
  • Type

    conf

  • DOI
    10.1109/IJCNN.1993.713993
  • Filename
    713993