DocumentCode :
2230286
Title :
Sparse Sampling Action Values Initialized by a Compact Representation Technique
Author :
Alves, Celeny F. ; Colombini, Esther L. ; Ribeiro, Carlos H C
Author_Institution :
Inst. of Aeronaut., Sao Jose dos Campos
fYear :
2007
fDate :
20-24 Oct. 2007
Firstpage :
729
Lastpage :
734
Abstract :
Most of the techniques proposed for problems involving mobile robots are specified in terms of optimal control of Markov decision processes (MDPs). However, the state space dimension explosion makes such tabular MDP-based solutions unfeasible. As an alternative to this, a planning technique based on sparse sampling (SSA) of simulated instances of a MDP model has been suggested. Because the execution time of this algorithm is exponential on the level of an exploration tree and on the number of samplings to be generated, this paper proposes a technique where leaves null-values in the SSA algorithm are substitute by meaningful values, acquired from any of the following approaches: 1) a simple environment reward distribution; 2) a standard reinforcement learning algorithm, and 3) a compact representation on a coarse state discretization for generating initial estimates of the action values. The experiments carried out showed that such information-based variants of SSA lead quickly to better results than the original technique.
Keywords :
Markov processes; learning (artificial intelligence); mobile robots; optimal control; trees (mathematics); Markov decision processes; coarse state discretization; compact representation technique; environment reward distribution; exploration tree level; mobile robots; optimal control; sparse sampling action values; standard reinforcement learning algorithm; Convergence; Explosions; Intelligent robots; Intelligent systems; Learning; Mobile robots; Navigation; Optimal control; Sampling methods; State-space methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems Design and Applications, 2007. ISDA 2007. Seventh International Conference on
Conference_Location :
Rio de Janeiro
Print_ISBN :
978-0-7695-2976-9
Type :
conf
DOI :
10.1109/ISDA.2007.142
Filename :
4389694
Link To Document :
بازگشت