• DocumentCode
    2565205
  • Title

    MP-Draughts: A multiagent reinforcement learning system based on MLP and Kohonen-SOM neural networks

  • Author

    Duarte, Valquiria Aparecida Rosa ; Julia, Rita Maria Silva ; Barcelos, Ayres Roberto Araujo ; Otsuka, Alana Bueno

  • Author_Institution
    Comput. Sci. Dept., Fed. Univ. of Uberlandia - UFU, Uberlandia, Brazil
  • fYear
    2009
  • fDate
    11-14 Oct. 2009
  • Firstpage
    2270
  • Lastpage
    2275
  • Abstract
    This paper presents MP-Draughts (MultiPhase-Draughts): a multiagent environment for Draughts, where one agent - named IIGA- is built and trained such as to be specialized for the initial and the intermediate phases of the games and the remaining ones for the final phases of them. Each agent of MP-Draughts is a neural network which learns almost without human supervision (distinctly from the world champion agent Chinook). MP-Draughts issues from a continuous activity of research whose previous product was the efficient agent VisionDraughts. Despite its good general performance, VisionDraughts frequently does not succeed in final phases of a game, even being in advantageous situation compared to its opponent (for instance, getting into endgame loops). In order to try to reduce this misbehavior of the agent during endgames, MP-Draughts counts on 25 agents specialized for endgame phases, each one trained such as to be able to deal with a determined cluster of endgame board-states. These 25 clusters are mined by a Kohonen Network from a Data Base containing a large quantity of endgame board-states. After trained, MP-Draughts operates in the following way: first, VisionDraughts is used as IIGA; next, the endgame agent that represents the cluster which better fits the current endgame board-state will replace it up to the end of the game. This paper shows that such a strategy significantly improves the general performance of the player agents.
  • Keywords
    multi-agent systems; multilayer perceptrons; Kohonen-SOM neural network; MP-Draughts; VisionDraughts; continuous activity; endgame agent; endgame board-state; human supervision; multiagent environment; multiagent reinforcement learning system; multilayer perceptron; Computer science; Cybernetics; Humans; Interference; Machine learning; Neural networks; Power generation; Self organizing feature maps; US Department of Transportation; USA Councils; Cluste-ringKohonen Self-Organizing Maps; Draughts; Games; Machine Learning; Neural Networks; Reinforcement Learning; Temporal Difference;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
  • Conference_Location
    San Antonio, TX
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-2793-2
  • Electronic_ISBN
    1062-922X
  • Type

    conf

  • DOI
    10.1109/ICSMC.2009.5345960
  • Filename
    5345960