• DocumentCode
    2114298
  • Title

    Behavior coordination for a mobile robot using modular reinforcement learning

  • Author

    Uchibe, Eiji ; Asada, Minoru ; Hosoda, Koh

  • Author_Institution
    Dept. of Mech. Eng. for Comput.-Controlled Machinery, Osaka Univ., Japan
  • Volume
    3
  • fYear
    1996
  • fDate
    4-8 Nov 1996
  • Firstpage
    1329
  • Abstract
    Coordination of multiple behaviors independently obtained by a reinforcement learning method is one of the issues in order for the method to be scaled to larger and more complex robot learning tasks. Direct combination of all the state spaces for individual modules (subtasks) needs enormous learning time, and it causes hidden states. This paper presents a method of modular learning which coordinates multiple behaviors taking account of a trade-off between learning time and performance. First, in order to reduce the learning time the whole state space is classified into two categories based on the action values separately obtained by Q learning: the area where one of the learned behaviors is directly applicable (no more learning area), and the area where learning is necessary due to competition of multiple behaviors (re-learning area). Second, hidden states are detected by model fitting to the learned action values based on the information criterion. Finally, the initial action valves in the re-learning area are adjusted so that they can be consistent with the values in the no more learning area. The method is applied to one to one soccer playing robots. Computer simulation and real robot experiments are given, to show the validity of the proposed method
  • Keywords
    learning (artificial intelligence); mobile robots; robot programming; action values; behavior coordination; mobile robot; model fitting; modular reinforcement learning; multiple behaviors; one to one soccer playing robots; Autonomous agents; Computer simulation; Learning; Machinery; Mobile robots; Orbital robotics; Robot kinematics; Robot sensing systems; Robotics and automation; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems '96, IROS 96, Proceedings of the 1996 IEEE/RSJ International Conference on
  • Conference_Location
    Osaka
  • Print_ISBN
    0-7803-3213-X
  • Type

    conf

  • DOI
    10.1109/IROS.1996.568989
  • Filename
    568989