Sparse Sampling Action Values Initialized by a Compact Representation Technique

Author

Alves, Celeny F. ; Colombini, Esther L. ; Ribeiro, Carlos H C

Author_Institution

Inst. of Aeronaut., Sao Jose dos Campos

fYear

2007

fDate

20-24 Oct. 2007

Firstpage

729

Lastpage

734

Abstract

Most of the techniques proposed for problems involving mobile robots are specified in terms of optimal control of Markov decision processes (MDPs). However, the state space dimension explosion makes such tabular MDP-based solutions unfeasible. As an alternative to this, a planning technique based on sparse sampling (SSA) of simulated instances of a MDP model has been suggested. Because the execution time of this algorithm is exponential on the level of an exploration tree and on the number of samplings to be generated, this paper proposes a technique where leaves null-values in the SSA algorithm are substitute by meaningful values, acquired from any of the following approaches: 1) a simple environment reward distribution; 2) a standard reinforcement learning algorithm, and 3) a compact representation on a coarse state discretization for generating initial estimates of the action values. The experiments carried out showed that such information-based variants of SSA lead quickly to better results than the original technique.

Keywords

Markov processes; learning (artificial intelligence); mobile robots; optimal control; trees (mathematics); Markov decision processes; coarse state discretization; compact representation technique; environment reward distribution; exploration tree level; mobile robots; optimal control; sparse sampling action values; standard reinforcement learning algorithm; Convergence; Explosions; Intelligent robots; Intelligent systems; Learning; Mobile robots; Navigation; Optimal control; Sampling methods; State-space methods;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Systems Design and Applications, 2007. ISDA 2007. Seventh International Conference on

Conference_Location

Rio de Janeiro

Print_ISBN

978-0-7695-2976-9

Type

conf

DOI

10.1109/ISDA.2007.142

Filename

4389694