Vision-based reinforcement learning for humanoid behavior generation with rhythmic walking parameters

Author

Ogino, Masaki ; Katoh, Yutaka ; Aono, Masahiro ; Asada, Minoru ; Hosoda, Koh

Author_Institution

Dept. of Adaptive Machine Syst., Osaka Univ., Japan

Volume

2

fYear

2003

fDate

27-31 Oct. 2003

Firstpage

1665

Abstract

This paper presents a method for generating vision-based humanoid behaviors by reinforcement learning with rhythmic walking parameters. The walking is stabilized by a rhythmic motion controller such as CPG or neural oscillator. The learning process consists of two stages: the first one is building an action space with two parameters (a forward step length and a turning angle) that inhibits combinations that are not feasible. The second is reinforcement learning with the constructed action space and the state space consisting of visual features and posture parameters to find feasible actions. The method is applied to a situation of the RoboCupSoccer humanoid league [H. Kitano and M. Asada, Advanced Robotics, 2000], that is, to approach the ball and to shoot it into the goal. Instructions by human are given to start up the learning process and the rest is completely self-learning in real situations.

Keywords

learning (artificial intelligence); legged locomotion; motion control; neurocontrollers; position control; robot vision; RoboCupSoccer humanoid league; action space; central pattern generator; neural oscillator; posture parameters; reinforcement learning; rhythmic motion controller; rhythmic walking parameters; state space; vision-based humanoid behaviors; visual features; Adaptive systems; Buildings; Humans; Learning; Legged locomotion; Motion control; Oscillators; State-space methods; Trajectory; Turning;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Robots and Systems, 2003. (IROS 2003). Proceedings. 2003 IEEE/RSJ International Conference on

Print_ISBN

0-7803-7860-1

Type

conf

DOI

10.1109/IROS.2003.1248883

Filename

1248883