• DocumentCode
    1874000
  • Title

    Evolution versus Temporal Difference Learning for learning to play Ms. Pac-Man

  • Author

    Burrow, Peter ; Lucas, Simon M.

  • Author_Institution
    Sch. of Comput. Sci. & Electron. Eng., Univ. of Essex, Colchester, UK
  • fYear
    2009
  • fDate
    7-10 Sept. 2009
  • Firstpage
    53
  • Lastpage
    60
  • Abstract
    This paper investigates various factors that affect the ability of a system to learn to play Ms. Pac-Man. For this study Ms. Pac-Man provides a game of appropriate complexity, and has the advantage that there have been many other papers published on systems that learn to play this game. The results indicate that temporal difference learning (TDL) performs most reliably with a tabular function approximator, and that the reward structure chosen can have a dramatic impact on performance. When using a multi-layer perceptron as a function approximator, evolution outperforms TDL by a significant margin. Overall, the best results were obtained by evolving multi-layer perceptrons.
  • Keywords
    computer games; function approximation; multilayer perceptrons; temporal reasoning; Ms. Pac-Man; game; multilayer perceptron; reward structure; tabular function approximator; temporal difference learning; Artificial neural networks; Evolution (biology); Evolutionary computation; Genetic algorithms; Genetic programming; Multilayer perceptrons; Parallel programming; State estimation; Testing; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Games, 2009. CIG 2009. IEEE Symposium on
  • Conference_Location
    Milano
  • Print_ISBN
    978-1-4244-4814-2
  • Electronic_ISBN
    978-1-4244-4815-9
  • Type

    conf

  • DOI
    10.1109/CIG.2009.5286495
  • Filename
    5286495