DocumentCode :
2114023
Title :
Realtime reinforcement learning for a real robot in the real environment
Author :
Yamaguchi, Tomohiro ; Masubuchi, Motohmi ; Ihara, Kamtsugu Fuj ; Achida, Masahiko Y.
Author_Institution :
Fac. of Eng. Sci., Osaka Univ., Japan
Volume :
3
fYear :
1996
fDate :
4-8 Nov 1996
Firstpage :
1321
Abstract :
For a real robot to acquire behaviors, it is important for it to learn in a real environment. Most reinforcement learning research has been made by simulation because real-environment learning requires large computation costs as well as a lot of time. Realizing reinforcement learning of a physical robot in a real environment requires both an adaptation for the diversity of possible situations and a high-speed learning method that can learn from fewer trials. This paper describes the realtime reinforcement learning for a real robot in the real environment based on the exploitation oriented reinforcement learning method where the learning cost is very small and has strict incrementality to realize realtime reinforcement learning with an automated sub-rewards generation method achieved by the abstraction of the task state to accelerate the learning process. Successive learning experiments in the real environment for the ball pushing task for the real robot are performed
Keywords :
intelligent control; learning (artificial intelligence); robot programming; automated sub-rewards generation method; ball pushing task; behaviors acquisition; exploitation oriented reinforcement learning method; high-speed learning method; real environment; realtime reinforcement learning; Artificial intelligence; Computational efficiency; Computational modeling; Computer simulation; Costs; Intelligent robots; Learning systems; Robotics and automation; Systems engineering and theory; Virtual environment;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems '96, IROS 96, Proceedings of the 1996 IEEE/RSJ International Conference on
Conference_Location :
Osaka
Print_ISBN :
0-7803-3213-X
Type :
conf
DOI :
10.1109/IROS.1996.568988
Filename :
568988
Link To Document :
بازگشت