Title of article :
Programming backgammon using self-teaching neural nets Original Research Article
Author/Authors :
Gerald Tesauro، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2002
Pages :
19
From page :
181
To page :
199
Abstract :
TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD-Gammonʹs self-teaching methodology results in a surprisingly strong program: without lookahead, its positional judgement rivals that of human experts, and when combined with shallow lookahead, it reaches a level of play that surpasses even the best human players. The success of TD-Gammon has also been replicated by several other programmers; at least two other neural net programs also appear to be capable of superhuman play. Previous papers on TD-Gammon have focused on developing a scientific understanding of its reinforcement learning methodology. This paper views machine learning as a tool in a programmerʹs toolkit, and considers how it can be combined with other programming techniques to achieve and surpass world-class backgammon play. Particular emphasis is placed on programming shallow-depth search algorithms, and on TD-Gammonʹs doubling algorithm, which is described in print here for the first time.
Keywords :
Rollouts , Doubling strategy , Reinforcement learning , Temporal difference learning , Neural networks , Games , Backgammon
Journal title :
Artificial Intelligence
Serial Year :
2002
Journal title :
Artificial Intelligence
Record number :
1207088
Link To Document :
بازگشت