مرکز منطقه ای اطلاع رساني علوم و فناوري - The measurement of strategy convergence for reinforcement learning in discrete state space

DocumentCode :

567007

Title :

The measurement of strategy convergence for reinforcement learning in discrete state space

Author :

Gao, Yanming ; Yin, Jie ; Wang, Bo ; Qu, Peng ; Zhou, Ling

Author_Institution :

Shandong Provincial Key Laboratory of Marine Ecology and Environment & Disaster Prevention and Mitigation, North China Sea Branch of The State Oceanic Administration, Qingdao, Shandong, China

Volume :

fYear :

2012

fDate :

25-27 May 2012

Firstpage :

213

Lastpage :

219

Abstract :

The concept of entropy is introduced into reinforcement learning. The definitions of the local strategy entropy and global strategy entropy are proposed. The global strategy entropy is proved to be the quantitative problem-independent measurement of the learning progress, i.e. the convergence degree of the strategy. To improve the learning performance, reinforcement learning with self-adaptive learning rate is proposed based on the strategy entropy. The experimental results show that learning based on the local strategy entropy has better learning performance than those with fixed learning rates.

Keywords :

convergence; learning rate; reinforcement learning; strategy entropy;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Science and Automation Engineering (CSAE), 2012 IEEE International Conference on

Conference_Location :

Zhangjiajie, China

Print_ISBN :

978-1-4673-0088-9

Type :

conf

DOI :

10.1109/CSAE.2012.6272761

Filename :

6272761

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=567007