DocumentCode
397875
Title
Q-ac: multiagent reinforcement learning with perception-conversion action
Author
Sun, Ruoying ; Tatsum, Shoji ; Zhao, Gang
Author_Institution
Fac. of Eng., Osaka City Univ., Japan
Volume
3
fYear
2003
fDate
5-8 Oct. 2003
Firstpage
2950
Abstract
For the task under Markov Decision Process, this paper presents a novel multiagent Reinforcement Learning (RL) with perception and conversion action mechanism that learning agents observe adversary agent and convert adversarial action to learning agents´ corresponding action as observing state variation incurred by the adversary agent in the task environment during learning processes. Meanwhile, this paper surveys inexpensive communication ways among learning agents utilizing both the direct communication and the indirect media communication to realize agents´ cooperation. The direct communication is realized by sharing sensation; the indirect media communication is realized by updating reinforcement values on the common environment observation. Then, a multiagent RL algorithm, Q-ac multiagent RL method, is proposed. By perception and conversion action, the learning agents extend learning episodes and derive more observation by less action. The direct communication enhances agents´ observation ability to the environment, and the indirect media communication improves agents´ ability deriving the optimal action policy. The simulation results on hunter game demonstrate the efficiency of the proposed method.
Keywords
Markov processes; learning (artificial intelligence); multi-agent systems; Markov decision process; Q-ac multiagent RL method; Q-ac multiagent reinforcement learning method; hunter game; media communication; perception-conversion action mechanism; Control system synthesis; Delay; Design methodology; Learning systems; Stochastic processes; Sun;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2003. IEEE International Conference on
ISSN
1062-922X
Print_ISBN
0-7803-7952-7
Type
conf
DOI
10.1109/ICSMC.2003.1244340
Filename
1244340
Link To Document