DocumentCode :
2281275
Title :
An adjustment method of the number of states of Q-learning segmenting state space adaptively
Author :
Hamagami, Tomoki ; Hirata, Hironori
Author_Institution :
Graduate Sch. of Sci. & Technol., Chiba Univ., Japan
Volume :
4
fYear :
2003
fDate :
5-8 Oct. 2003
Firstpage :
3062
Abstract :
This paper presents a method to partition a continuous state space for the purposes of realizing an autonomous behavior of agent. The basic idea of this partitioning technique is derived from QLASS (Q-learning with adaptive state segmentation) which is a simple and effective technique. In segmentation by QLASS, since discrete state space is constructed as Voronoi diagram which is generated by a set of a finite number of points called generators, the state space is intuitively easy to understand. However, as QLASS has a problem that the algorithm generates too many segments in which during the learning, an agent, which uses QLASS, cannot learn appropriate action efficiently. To overcome this problem, an adjustment method of the number of states is proposed, method which restricts or boosts the partitioning by using eligibilities and temperature parameter of each segment. Experimental results show that this adjustment method can partition state space suitably according to not only the environment characteristic but its dynamic changes.
Keywords :
computational geometry; learning (artificial intelligence); mobile robots; state-space methods; Q-learning segmenting state space; Voronoi diagram; adaptive learning; adjustment method; autonomous behavior; discrete state space; environment characteristic; generators; partitioning technique; reinforcement learning; temperature parameter; Frequency; Kernel; Learning; Partitioning algorithms; Shape; Space technology; State estimation; State-space methods; Temperature; Timing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2003. IEEE International Conference on
ISSN :
1062-922X
Print_ISBN :
0-7803-7952-7
Type :
conf
DOI :
10.1109/ICSMC.2003.1244360
Filename :
1244360
Link To Document :
بازگشت