DocumentCode
3290520
Title
Learning strategy fusion for acquiring crawling behavior in multiple environments
Author
Yamaguchi, Akira ; Takamatsu, Jun ; Ogasawara, T.
Author_Institution
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Ikoma, Japan
fYear
2013
fDate
12-14 Dec. 2013
Firstpage
605
Lastpage
612
Abstract
Though a reinforcement learning method is considered as a promising method for learning a robot´s behavior from reward signals and adapting it for unknown environment, a standard reinforcement learning method is for a single environment. In this paper, to make a robot working in wider environments, we develop a reinforcement learning method for (1) estimating the current environment, (2) choosing a suitable policy for a known environment, and (3) making learning efficient when learning in a new environment by using transfer learning. To achieve them, we extend the learning strategy (LS) fusion method [1]. LS fusion is a method to learn multiple policies for a single task by applying multiple learning strategies (LSs) step by step. The key idea of environment estimation is using reward statistics of learned policies. For efficient learning, we design a learning strategy to transfer a policy learned in a different environment to one for the current environment. To verify the proposed method, we conducted some experiments where a small size humanoid robot learned a crawling task in several kinds of environments.
Keywords
humanoid robots; learning (artificial intelligence); sensor fusion; statistics; LS fusion method; crawling behavior acquisition; humanoid robot; learning strategy fusion method; multiple learning strategies; reinforcement learning method; reward signals; reward statistics; robot behavior learning; transfer learning; Aerospace electronics; Estimation; Joints; Learning (artificial intelligence); Radiation detectors; Robots; Standards;
fLanguage
English
Publisher
ieee
Conference_Titel
Robotics and Biomimetics (ROBIO), 2013 IEEE International Conference on
Conference_Location
Shenzhen
Type
conf
DOI
10.1109/ROBIO.2013.6739526
Filename
6739526
Link To Document