مرکز منطقه ای اطلاع رساني علوم و فناوري - A novel heuristic Q-learning algorithm for solving stochastic games

DocumentCode :

2956517

Title :

A novel heuristic Q-learning algorithm for solving stochastic games

Author :

Li, Jianwei ; Liu, Weiyi

Author_Institution :

Dept. of Comput. Sci. & Eng., Yunnan Univ., Kunming

fYear :

2008

fDate :

1-8 June 2008

Firstpage :

1135

Lastpage :

1144

Abstract :

We solve Nash equilibrium of stochastic games using heuristic Q-learning method based on ldquoheuristic learningrdquo + ldquo Q-learningrdquo under the framework of noncooperative general-sum games. Determining whether a strategy Nash equilibrium exists in a stochastic game is NP-hard even if the game is finite. Therefore normal Q-learning method based on iterative learning canpsilat solve stochastic games with larger scale. We attempt to make heuristic evaluations for the rewards of each stage game encountered during learning and improve continually the relevant heuristic Q-values in order to approach the optimal learning. Based on such thought, we proposed Multi-agent Heuristic Q-Learning(MHQL)method and proved that its correctness, convergence and acceptable solving time complexity. The experimentation shows that our method can drastically decrease inefficient and repetitive learning thus speed up convergence than iterative Q-learning. Our method can be regarded as a basic framework for general heuristic Q-learning to design better heuristic learning rules.

Keywords :

computational complexity; learning (artificial intelligence); mathematics computing; stochastic games; NP-hard; Nash equilibrium; multi-agent heuristic Q-learning method; noncooperative general-sum games; stochastic games; time complexity; Convergence; Cybernetics; Decision making; Game theory; Heuristic algorithms; Iterative algorithms; Iterative methods; Learning; Nash equilibrium; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on

Conference_Location :

Hong Kong

ISSN :

1098-7576

Print_ISBN :

978-1-4244-1820-6

Electronic_ISBN :

1098-7576

Type :

conf

DOI :

10.1109/IJCNN.2008.4633942

Filename :

4633942

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2956517