• DocumentCode
    1604623
  • Title

    Phased Learning with Hierarchical Reinforcement Learning in Nonholonomic Motion Control

  • Author

    Goto, Takaknuni ; Homma, Noriyasu ; Yoshizawa, Makoto ; Abe, Kenichi

  • Author_Institution
    Dept. of Electr. & Commun. Eng., Tohoku Univ.
  • fYear
    2006
  • Firstpage
    4557
  • Lastpage
    4562
  • Abstract
    In this paper, a hierarchical reinforcement learning algorithm for controlling nonholonomic systems is proposed. When applying reinforcement learning to the nonholonomic systems, acquiring adequate policies is difficult because of an increase of learning steps and a convergence of local optimal policies. The proposed algorithm is inspired by such human learning behavior. Human can learn to control such systems sufficiently even if they initially have little knowledge about the system´s dynamics and the way to control. This human capability is suggested to be caused by their exploration strategies for acquiring the adequate policies. The key element of the proposed algorithm is a shaping function defined on a novel position-direction space. The shaping function is autonomously constructed once the goal is reached and constrains the exploration area to optimize the policy. The efficiency of the proposed shaping function was demonstrated by using a nonholonomic control problem of positioning the 2-link planer underactuated manipulator
  • Keywords
    learning (artificial intelligence); manipulators; motion control; 2-link planer underactuated manipulator; human learning behavior; nonholonomic motion control; phased learning; reinforcement learning algorithm; shaping function; Communication system control; Computer science; Control systems; Convergence; Delay; Educational institutions; Humans; Learning; Medical control systems; Motion control; human learning behavior; nonholonomic systems; reinforcement learning; shaping function;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SICE-ICASE, 2006. International Joint Conference
  • Conference_Location
    Busan
  • Print_ISBN
    89-950038-4-7
  • Electronic_ISBN
    89-950038-5-5
  • Type

    conf

  • DOI
    10.1109/SICE.2006.315088
  • Filename
    4108481