DocumentCode :
1277054
Title :
Neural-Fitted TD-Leaf Learning for Playing Othello With Structured Neural Networks
Author :
van den Dries, S. ; Wiering, Marco A.
Author_Institution :
Fac. of Mech. Eng., Eindhoven Univ. of Technol., Eindhoven, Netherlands
Volume :
23
Issue :
11
fYear :
2012
Firstpage :
1701
Lastpage :
1713
Abstract :
This paper describes a methodology for quickly learning to play games at a strong level. The methodology consists of a novel combination of three techniques, and a variety of experiments on the game of Othello demonstrates their usefulness. First, structures or topologies in neural network connectivity patterns are used to decrease the number of learning parameters and to deal more effectively with the structural credit assignment problem, which is to change individual network weights based on the obtained feedback. Furthermore, the structured neural networks are trained with the novel neural-fitted temporal difference (TD) learning algorithm to create a system that can exploit most of the training experiences and enhance learning speed and performance. Finally, we use the neural-fitted TD-leaf algorithm to learn more effectively when look-ahead search is performed by the game-playing program. Our extensive experimental study clearly indicates that the proposed method outperforms linear networks and fully connected neural networks or evaluation functions evolved with evolutionary algorithms.
Keywords :
computer games; evolutionary computation; learning (artificial intelligence); Othello playing; TD learning algorithm; evolutionary algorithms; game-playing program; individual network weights; look-ahead search; neural network connectivity patterns; neural-fitted TD-leaf algorithm; neural-fitted temporal difference learning algorithm; structural credit assignment problem; structured neural network training; Color; Games; Learning; Neural networks; Sociology; Statistics; Training; Evolutionary algorithms (EAs); Othello; reinforcement learning; structured neural networks; temporal difference (TD) learning;
fLanguage :
English
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
2162-237X
Type :
jour
DOI :
10.1109/TNNLS.2012.2210559
Filename :
6291798
Link To Document :
بازگشت