DocumentCode :
2334505
Title :
Constructing continuous action space from basis functions for fast and stable reinforcement learning
Author :
Yamaguchi, Akihiko ; Takamatsu, Jun ; Ogasawara, Tsukasa
Author_Institution :
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Ikoma, Japan
fYear :
2009
fDate :
Sept. 27 2009-Oct. 2 2009
Firstpage :
401
Lastpage :
407
Abstract :
This paper presents a new continuous action space for reinforcement learning (RL) with the wire-fitting. The wire-fitting has a desirable feature to be used with action value function based RL algorithms. However, the wire-fitting becomes unstable caused by changing the parameters of actions. Furthermore, the acquired behavior highly depend on the initial values of the parameters. The proposed action space is expanded from the DCOB, proposed by Yamaguchi et al., where the discrete action set is generated from given basis functions. Based on the DCOB, we apply some constraints to the parameters in order to obtain stability. Furthermore, we also describe a proper way to initialize the parameters. The simulation results demonstrate that the proposed method outperforms the wire-fitting. On the other hand, the resulting performance of the proposed method is the same as, or inferior to the DCOB. This paper also discuss about this result.
Keywords :
curve fitting; learning (artificial intelligence); stability; action value function; continuous action space; discrete action set; reinforcement learning; wire-fitting; Human robot interaction; Humanoid robots; Information science; Learning; Legged locomotion; Orbital robotics; Robot control; Space technology; Stability; Reinforcement learning; continuous action space; crawling; jumping; motion learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robot and Human Interactive Communication, 2009. RO-MAN 2009. The 18th IEEE International Symposium on
Conference_Location :
Toyama
ISSN :
1944-9445
Print_ISBN :
978-1-4244-5081-7
Electronic_ISBN :
1944-9445
Type :
conf
DOI :
10.1109/ROMAN.2009.5326234
Filename :
5326234
Link To Document :
بازگشت