DocumentCode :
2326979
Title :
An Improved Reinforcement Q-Learning Method with BP Neural Networks in Robot Soccer
Author :
Wang, Shi-chao ; Song, Zheng-xi ; Ding, Hao ; Shi, Hao-bin
Author_Institution :
Sch. of Electron. & Inf., Northwestern Polytech. Univ., Xi´´an, China
Volume :
1
fYear :
2011
fDate :
28-30 Oct. 2011
Firstpage :
177
Lastpage :
180
Abstract :
In traditional reinforcement Q-Learning method, there exists two problems: difficulty of dividing the state information, complexity of extreme large dimension input. To solve these two problems, this paper proposed an improved reinforcement Q-Learning method with BP neutral network. In this method, the large Q table is replaced by a BP neural network. Continuous environmental information is the input. The Q value is the output. The Q value and weight of the network are also adjusted by the action rewards. This paper presents an algorithm for single agent´s action selection. Simulation shows proposed method is more stable and applicable for the agent´s strategy selection.
Keywords :
backpropagation; control engineering computing; learning (artificial intelligence); multi-robot systems; neural nets; BP neural networks; Q value; environmental information; reinforcement Q-learning Method; robot soccer; Biological neural networks; Educational institutions; Equations; Learning systems; Neurons; Robots; Training; BP Neural Networks; Reinforcement Q-Learning; Robot Soccer;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Design (ISCID), 2011 Fourth International Symposium on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4577-1085-8
Type :
conf
DOI :
10.1109/ISCID.2011.53
Filename :
6079665
Link To Document :
بازگشت