DocumentCode :
2134743
Title :
Reinforcement learning of dynamic motor sequence: learning to stand up
Author :
Morimoto, Jun ; Doya, Kenji
Author_Institution :
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Japan
Volume :
3
fYear :
1998
fDate :
13-17 Oct 1998
Firstpage :
1721
Abstract :
We propose a learning method for implementing human-like sequential movements in robots. As an example of dynamic sequential movement, we consider the “stand-up” task for a two-joint, three-link robot. In contrast to the case of steady walking or standing, the desired trajectory for such a transient behavior is very difficult to derive. The goal of the task is to find a path that links a lying state to an upright state under the constraints of the system dynamics. The geometry of the robot is such that there is no static solution; the robot has to stand up dynamically utilizing the momentum of its body. We use reinforcement learning, in particular, a continuous time and state temporal difference (TD) learning method. For successful results, we use 1) an efficient method of value function approximation in a high-dimensional state space, and 2) a hierarchical architecture which divides a large state space into a few smaller pieces
Keywords :
function approximation; learning (artificial intelligence); legged locomotion; neural nets; robot dynamics; continuous time state temporal difference learning method; dynamic motor sequence; hierarchical architecture; high-dimensional state space; human-like sequential movements; lying state; reinforcement learning; stand-up task; transient behavior; two-joint three-link robot; upright state; value function approximation; Computational geometry; Computer architecture; Computer simulation; Function approximation; Humanoid robots; Humans; Information science; Learning systems; Orbital robotics; State-space methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 1998. Proceedings., 1998 IEEE/RSJ International Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
0-7803-4465-0
Type :
conf
DOI :
10.1109/IROS.1998.724846
Filename :
724846
Link To Document :
بازگشت