Title :
Performance analysis of a new updating rule for TD(λ) learning in feedforward networks for position evaluation in Go game
Author :
Chan, Horace Wai-kit ; King, Irwin ; Lui, John C S
Author_Institution :
Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong, Shatin, Hong Kong
Abstract :
In this paper, a new updating rule for applying temporal difference (TD) learning to multilayer feedforward networks is derived. Networks are trained to evaluate Go board positions by TD(λ) learning with different values of λ. Performance of each network is estimated by letting it play against other networks. Results show that nonzero λ gives better learning for the network and statistically, larger λ gives better performance
Keywords :
feedforward neural nets; games of skill; learning (artificial intelligence); multilayer perceptrons; temporal reasoning; Go game; TD(λ) learning; multilayer feedforward networks; neural nets; performance analysis; position evaluation; temporal difference learning; updating rule; Backpropagation; Computer science; Delay; Design engineering; Humans; Intelligent networks; Knowledge engineering; Neural networks; Performance analysis; Supervised learning;
Conference_Titel :
Neural Networks, 1996., IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
0-7803-3210-5
DOI :
10.1109/ICNN.1996.549159