DocumentCode :
2294549
Title :
A reinforcement learning scheme for a multi-agent card game
Author :
Fujita, Hajime ; Matsuno, Yoichiro ; Ishii, Shin
Author_Institution :
Nara Inst. of Sci. & Technol., Japan
Volume :
5
fYear :
2003
fDate :
5-8 Oct. 2003
Firstpage :
4071
Abstract :
We formulate an automatic strategy acquisition problem for the multi-agent card game "hearts" as a reinforcement learning (RL) problem. Since there are often a lot of unobservable cards in this game, RL is approximately dealt with in the framework of a partially observable Markov decision process (POMDP). This article presents a POMDP-RL method based on estimation of unobservable state variables and prediction of actions of the opponent agents. Simulation results show our model-based POMDP-RL method is applicable to a realistic multi-agent problem.
Keywords :
Markov processes; computer games; decision making; learning (artificial intelligence); multi-agent systems; action control; actor-critic algorithm; automatic strategy acquisition problem; hearts card game; multiagent card game; partially observable Markov decision process; reinforcement learning scheme; state variables; Accelerated aging; Clocks; Heart; Learning; State estimation; State-space methods; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2003. IEEE International Conference on
ISSN :
1062-922X
Print_ISBN :
0-7803-7952-7
Type :
conf
DOI :
10.1109/ICSMC.2003.1245625
Filename :
1245625
Link To Document :
بازگشت