DocumentCode :
1942944
Title :
The Q(λ) algorithm based on heuristic reward function
Author :
Zhang, Jianhong ; Shi, Ying ; Xie, Xiaofei
Author_Institution :
Sch. of Inf. & Eng., Huzhou Teachers´´ Coll., Huzhou, China
fYear :
2010
fDate :
13-15 Aug. 2010
Firstpage :
139
Lastpage :
142
Abstract :
For reinforcement learning often show slow convergence speed problem in continuous and complex tasks, this paper proposes a Q(λ) algorithm based on heuristic reward function-Q(λ)-HRF algorithm. This algorithm can extract features from the environment and get the heuristic information, which can be applied to the study by Agent in the form of reward function, which can accelerate the convergence speed significantly. We also proved the convergence of the algorithm by mathematical way, and applied the algorithm to the Maze platform, the experimental results show that: the Q(λ)-HRF algorithm has better convergence speed than Q(λ) algorithm.
Keywords :
learning (artificial intelligence); HRF algorithm; Maze platform; Q(λ) algorithm; complex task; continuous task; convergence speed problem; feature extraction; heuristic reward function; reinforcement learning; Algorithm design and analysis; Convergence; Feature extraction; Heuristic algorithms; Learning; Machine learning algorithms; Markov processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Information Processing (ICICIP), 2010 International Conference on
Conference_Location :
Dalian
Print_ISBN :
978-1-4244-7047-1
Type :
conf
DOI :
10.1109/ICICIP.2010.5564220
Filename :
5564220
Link To Document :
بازگشت