Title :
A novel heuristic Q-learning algorithm for solving stochastic games
Author :
Li, Jianwei ; Liu, Weiyi
Author_Institution :
Dept. of Comput. Sci. & Eng., Yunnan Univ., Kunming
Abstract :
We solve Nash equilibrium of stochastic games using heuristic Q-learning method based on ldquoheuristic learningrdquo + ldquo Q-learningrdquo under the framework of noncooperative general-sum games. Determining whether a strategy Nash equilibrium exists in a stochastic game is NP-hard even if the game is finite. Therefore normal Q-learning method based on iterative learning canpsilat solve stochastic games with larger scale. We attempt to make heuristic evaluations for the rewards of each stage game encountered during learning and improve continually the relevant heuristic Q-values in order to approach the optimal learning. Based on such thought, we proposed Multi-agent Heuristic Q-Learning(MHQL)method and proved that its correctness, convergence and acceptable solving time complexity. The experimentation shows that our method can drastically decrease inefficient and repetitive learning thus speed up convergence than iterative Q-learning. Our method can be regarded as a basic framework for general heuristic Q-learning to design better heuristic learning rules.
Keywords :
computational complexity; learning (artificial intelligence); mathematics computing; stochastic games; NP-hard; Nash equilibrium; multi-agent heuristic Q-learning method; noncooperative general-sum games; stochastic games; time complexity; Convergence; Cybernetics; Decision making; Game theory; Heuristic algorithms; Iterative algorithms; Iterative methods; Learning; Nash equilibrium; Stochastic processes;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4633942