Behavior coordination for a mobile robot using modular reinforcement learning

Author

Uchibe, Eiji ; Asada, Minoru ; Hosoda, Koh

Author_Institution

Dept. of Mech. Eng. for Comput.-Controlled Machinery, Osaka Univ., Japan

Volume

3

fYear

1996

fDate

4-8 Nov 1996

Firstpage

1329

Abstract

Coordination of multiple behaviors independently obtained by a reinforcement learning method is one of the issues in order for the method to be scaled to larger and more complex robot learning tasks. Direct combination of all the state spaces for individual modules (subtasks) needs enormous learning time, and it causes hidden states. This paper presents a method of modular learning which coordinates multiple behaviors taking account of a trade-off between learning time and performance. First, in order to reduce the learning time the whole state space is classified into two categories based on the action values separately obtained by Q learning: the area where one of the learned behaviors is directly applicable (no more learning area), and the area where learning is necessary due to competition of multiple behaviors (re-learning area). Second, hidden states are detected by model fitting to the learned action values based on the information criterion. Finally, the initial action valves in the re-learning area are adjusted so that they can be consistent with the values in the no more learning area. The method is applied to one to one soccer playing robots. Computer simulation and real robot experiments are given, to show the validity of the proposed method

Keywords

learning (artificial intelligence); mobile robots; robot programming; action values; behavior coordination; mobile robot; model fitting; modular reinforcement learning; multiple behaviors; one to one soccer playing robots; Autonomous agents; Computer simulation; Learning; Machinery; Mobile robots; Orbital robotics; Robot kinematics; Robot sensing systems; Robotics and automation; State-space methods;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Robots and Systems '96, IROS 96, Proceedings of the 1996 IEEE/RSJ International Conference on

Conference_Location

Osaka

Print_ISBN

0-7803-3213-X

Type

conf

DOI

10.1109/IROS.1996.568989

Filename

568989