• DocumentCode
    2838747
  • Title

    Research on improvement of model-free average reward reinforcement learning and its simulation experiment

  • Author

    Chen, Wei ; Zhai, Zhenkun ; Li, Xiong ; Guo, Jing ; Wang, Jie

  • Author_Institution
    Fac. of Autom., Guangdong Univ. of Technol., Guangzhou, China
  • fYear
    2009
  • fDate
    17-19 June 2009
  • Firstpage
    4933
  • Lastpage
    4936
  • Abstract
    Traditional reinforcement learning always emphasizes the independent learning of a single agent. In Multi-Agent System (MAS), considering the relationship between independent learning and group learning, this paper presents a hybrid algorithm based on average reward reinforcement learning. In learning process of the modified algorithm, it still pays attention to the independent learning. In order to select an action which can reflect the multi-agent environmental information, we add the observed information and the prediction of other agent´s actions when the learning agent chooses his action according to the current environmental state. The advantage of this design is that not only the agent will learn the optimal policy through autonomous study, but also as one member of MAS, the learning process can be integrated into the whole multi-agent environment. Robocup simulation league (2D) is a typical multi-agent system. By applying the new method to the training of the player, we prove the feasibility and validity of this algorithm.
  • Keywords
    control engineering computing; learning (artificial intelligence); multi-agent systems; hybrid algorithm; multi-agent system; reinforcement learning; robocup simulation league; Artificial intelligence; Automation; Autonomous agents; Learning; Multiagent systems; Robots; State-space methods; Stochastic systems; Multi-agent system; R-learning; Reinforcement learning; Robocup;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control and Decision Conference, 2009. CCDC '09. Chinese
  • Conference_Location
    Guilin
  • Print_ISBN
    978-1-4244-2722-2
  • Electronic_ISBN
    978-1-4244-2723-9
  • Type

    conf

  • DOI
    10.1109/CCDC.2009.5194915
  • Filename
    5194915