• DocumentCode
    567007
  • Title

    The measurement of strategy convergence for reinforcement learning in discrete state space

  • Author

    Gao, Yanming ; Yin, Jie ; Wang, Bo ; Qu, Peng ; Zhou, Ling

  • Author_Institution
    Shandong Provincial Key Laboratory of Marine Ecology and Environment & Disaster Prevention and Mitigation, North China Sea Branch of The State Oceanic Administration, Qingdao, Shandong, China
  • Volume
    2
  • fYear
    2012
  • fDate
    25-27 May 2012
  • Firstpage
    213
  • Lastpage
    219
  • Abstract
    The concept of entropy is introduced into reinforcement learning. The definitions of the local strategy entropy and global strategy entropy are proposed. The global strategy entropy is proved to be the quantitative problem-independent measurement of the learning progress, i.e. the convergence degree of the strategy. To improve the learning performance, reinforcement learning with self-adaptive learning rate is proposed based on the strategy entropy. The experimental results show that learning based on the local strategy entropy has better learning performance than those with fixed learning rates.
  • Keywords
    convergence; learning rate; reinforcement learning; strategy entropy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Automation Engineering (CSAE), 2012 IEEE International Conference on
  • Conference_Location
    Zhangjiajie, China
  • Print_ISBN
    978-1-4673-0088-9
  • Type

    conf

  • DOI
    10.1109/CSAE.2012.6272761
  • Filename
    6272761