• DocumentCode
    2230286
  • Title

    Sparse Sampling Action Values Initialized by a Compact Representation Technique

  • Author

    Alves, Celeny F. ; Colombini, Esther L. ; Ribeiro, Carlos H C

  • Author_Institution
    Inst. of Aeronaut., Sao Jose dos Campos
  • fYear
    2007
  • fDate
    20-24 Oct. 2007
  • Firstpage
    729
  • Lastpage
    734
  • Abstract
    Most of the techniques proposed for problems involving mobile robots are specified in terms of optimal control of Markov decision processes (MDPs). However, the state space dimension explosion makes such tabular MDP-based solutions unfeasible. As an alternative to this, a planning technique based on sparse sampling (SSA) of simulated instances of a MDP model has been suggested. Because the execution time of this algorithm is exponential on the level of an exploration tree and on the number of samplings to be generated, this paper proposes a technique where leaves null-values in the SSA algorithm are substitute by meaningful values, acquired from any of the following approaches: 1) a simple environment reward distribution; 2) a standard reinforcement learning algorithm, and 3) a compact representation on a coarse state discretization for generating initial estimates of the action values. The experiments carried out showed that such information-based variants of SSA lead quickly to better results than the original technique.
  • Keywords
    Markov processes; learning (artificial intelligence); mobile robots; optimal control; trees (mathematics); Markov decision processes; coarse state discretization; compact representation technique; environment reward distribution; exploration tree level; mobile robots; optimal control; sparse sampling action values; standard reinforcement learning algorithm; Convergence; Explosions; Intelligent robots; Intelligent systems; Learning; Mobile robots; Navigation; Optimal control; Sampling methods; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems Design and Applications, 2007. ISDA 2007. Seventh International Conference on
  • Conference_Location
    Rio de Janeiro
  • Print_ISBN
    978-0-7695-2976-9
  • Type

    conf

  • DOI
    10.1109/ISDA.2007.142
  • Filename
    4389694