DocumentCode
567007
Title
The measurement of strategy convergence for reinforcement learning in discrete state space
Author
Gao, Yanming ; Yin, Jie ; Wang, Bo ; Qu, Peng ; Zhou, Ling
Author_Institution
Shandong Provincial Key Laboratory of Marine Ecology and Environment & Disaster Prevention and Mitigation, North China Sea Branch of The State Oceanic Administration, Qingdao, Shandong, China
Volume
2
fYear
2012
fDate
25-27 May 2012
Firstpage
213
Lastpage
219
Abstract
The concept of entropy is introduced into reinforcement learning. The definitions of the local strategy entropy and global strategy entropy are proposed. The global strategy entropy is proved to be the quantitative problem-independent measurement of the learning progress, i.e. the convergence degree of the strategy. To improve the learning performance, reinforcement learning with self-adaptive learning rate is proposed based on the strategy entropy. The experimental results show that learning based on the local strategy entropy has better learning performance than those with fixed learning rates.
Keywords
convergence; learning rate; reinforcement learning; strategy entropy;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Automation Engineering (CSAE), 2012 IEEE International Conference on
Conference_Location
Zhangjiajie, China
Print_ISBN
978-1-4673-0088-9
Type
conf
DOI
10.1109/CSAE.2012.6272761
Filename
6272761
Link To Document