• DocumentCode
    3095791
  • Title

    Dynamic correlation matrix based multi-Q learning for a multi-robot system

  • Author

    Guo, Hongliang ; Meng, Yan

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Stevens Inst. of Technol., Hoboken, NJ
  • fYear
    2008
  • fDate
    22-26 Sept. 2008
  • Firstpage
    840
  • Lastpage
    845
  • Abstract
    Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selections, and difficulty in merging learned experiences from other robots. In this paper, we propose a dynamic correlation matrix based multi-Q learning (DCM-MultiQ) method for a distributed multi-robot system. A novel dynamic correlation matrix is proposed, which not only handles each agentpsilas Q value, but also deals with the correlation among agents. Furthermore, a theoretical proof of the convergence of the proposed DCM-MultiQ algorithm is also provided using a feedback matrix control theory. To evaluate the efficiency of the proposed DCM-MultiQ method, several case studies of a multi-robot system in forage tasks have been conducted. The simulation results show the efficiency and convergence of the proposed method.
  • Keywords
    correlation methods; feedback; learning (artificial intelligence); multi-robot systems; distributed multirobot system; dynamic correlation matrix; feedback matrix control theory; multiQ learning; reinforcement learning; Algorithm design and analysis; Artificial neural networks; Convergence; Correlation; Equations; Learning; Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on
  • Conference_Location
    Nice
  • Print_ISBN
    978-1-4244-2057-5
  • Type

    conf

  • DOI
    10.1109/IROS.2008.4651021
  • Filename
    4651021