• DocumentCode
    3144480
  • Title

    Analysis and Design of an Improved R-learning

  • Author

    Chen, Wei ; Zhai, Zhenkun ; Li, Xiong ; Guo, Jing

  • Author_Institution
    Fac. of Autom., Guangdong Univ. of Technol., Guangzhou, China
  • fYear
    2009
  • fDate
    1-3 June 2009
  • Firstpage
    48
  • Lastpage
    52
  • Abstract
    This paper presents a modified R-learning according to the traditional average reward reinforcement learning algorithm. Reinforcement learning problems constitute an important class of learning and control problems faced by artificial intelligence systems. The general framework of reinforcement learning can be divided into two forms, discounted reward reinforcement learning and average reward reinforcement learning. R-learning is a model-free average reward reinforcement learning algorithm. Comparing with the conventional R-learning algorithm, this paper undertakes a detailed examination of the improvement of the R-learning, by adding the directing reward function with punitive mechanism and the exploration strategy based on the roulette technique. As the result of this design, agent can gain more information in every learning step. Through applying the improved R-learning to Robocup simulation league (2D) and making comparison with the Q-learning, empirical results show that the learning efficiency has been increased.
  • Keywords
    Markov processes; decision theory; learning (artificial intelligence); mobile robots; multi-robot systems; Markov decision process; Q-learning; R-learning; Robocup simulation league; artificial intelligence system; discounted reward reinforcement learning; exploration strategy; model-free average reward reinforcement learning algorithm; punitive mechanism; roulette technique; Algorithm design and analysis; Artificial intelligence; Automatic control; Control systems; Design automation; Information analysis; Information science; Learning; Markov processes; Paper technology; Convergence; Markov Decision Process; R-learning; Reinforcement Learning; Reward Function;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Science, 2009. ICIS 2009. Eighth IEEE/ACIS International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-0-7695-3641-5
  • Type

    conf

  • DOI
    10.1109/ICIS.2009.188
  • Filename
    5223131