DocumentCode
2230286
Title
Sparse Sampling Action Values Initialized by a Compact Representation Technique
Author
Alves, Celeny F. ; Colombini, Esther L. ; Ribeiro, Carlos H C
Author_Institution
Inst. of Aeronaut., Sao Jose dos Campos
fYear
2007
fDate
20-24 Oct. 2007
Firstpage
729
Lastpage
734
Abstract
Most of the techniques proposed for problems involving mobile robots are specified in terms of optimal control of Markov decision processes (MDPs). However, the state space dimension explosion makes such tabular MDP-based solutions unfeasible. As an alternative to this, a planning technique based on sparse sampling (SSA) of simulated instances of a MDP model has been suggested. Because the execution time of this algorithm is exponential on the level of an exploration tree and on the number of samplings to be generated, this paper proposes a technique where leaves null-values in the SSA algorithm are substitute by meaningful values, acquired from any of the following approaches: 1) a simple environment reward distribution; 2) a standard reinforcement learning algorithm, and 3) a compact representation on a coarse state discretization for generating initial estimates of the action values. The experiments carried out showed that such information-based variants of SSA lead quickly to better results than the original technique.
Keywords
Markov processes; learning (artificial intelligence); mobile robots; optimal control; trees (mathematics); Markov decision processes; coarse state discretization; compact representation technique; environment reward distribution; exploration tree level; mobile robots; optimal control; sparse sampling action values; standard reinforcement learning algorithm; Convergence; Explosions; Intelligent robots; Intelligent systems; Learning; Mobile robots; Navigation; Optimal control; Sampling methods; State-space methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems Design and Applications, 2007. ISDA 2007. Seventh International Conference on
Conference_Location
Rio de Janeiro
Print_ISBN
978-0-7695-2976-9
Type
conf
DOI
10.1109/ISDA.2007.142
Filename
4389694
Link To Document