• DocumentCode
    303422
  • Title

    Performance analysis of a new updating rule for TD(λ) learning in feedforward networks for position evaluation in Go game

  • Author

    Chan, Horace Wai-kit ; King, Irwin ; Lui, John C S

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Chinese Univ. of Hong Kong, Shatin, Hong Kong
  • Volume
    3
  • fYear
    1996
  • fDate
    3-6 Jun 1996
  • Firstpage
    1716
  • Abstract
    In this paper, a new updating rule for applying temporal difference (TD) learning to multilayer feedforward networks is derived. Networks are trained to evaluate Go board positions by TD(λ) learning with different values of λ. Performance of each network is estimated by letting it play against other networks. Results show that nonzero λ gives better learning for the network and statistically, larger λ gives better performance
  • Keywords
    feedforward neural nets; games of skill; learning (artificial intelligence); multilayer perceptrons; temporal reasoning; Go game; TD(λ) learning; multilayer feedforward networks; neural nets; performance analysis; position evaluation; temporal difference learning; updating rule; Backpropagation; Computer science; Delay; Design engineering; Humans; Intelligent networks; Knowledge engineering; Neural networks; Performance analysis; Supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1996., IEEE International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    0-7803-3210-5
  • Type

    conf

  • DOI
    10.1109/ICNN.1996.549159
  • Filename
    549159