DocumentCode :
2121054
Title :
Acquisition of coordinated behavior by modular Q-learning agents
Author :
ONO, Norihiko ; Ikeda, Osamu ; Fukumoto, Kenji
Author_Institution :
Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan
Volume :
3
fYear :
1996
fDate :
4-8 Nov 1996
Firstpage :
1525
Abstract :
Recent attempts to let monolithic reinforcement-learning agents synthesize coordinated behavior scale poorly to more complicated multi-agent learning problems where multiple learning agents play different roles and work together for the accomplishment of their common goal. These learning agents have to receive and respond to various sensory information from their partners as well as that from the physical environment itself. Hence, their state spaces are subject to grow exponentially in the number of the partners. As an illustrative problem suffered from this kind of combinatorial explosion, we consider a modified version of the pursuit problem, and show how successfully a collection of modular Q-learning hunter agents synthesize coordinated decision policies needed to capture a randomly-fleeing prey agent effectively, by specializing their functionality and acquiring herding behavior
Keywords :
cooperative systems; game theory; learning (artificial intelligence); state-space methods; combinatorial explosion; coordinated behavior acquisition; functionality specialization; herding behavior; modular Q-learning hunter agents; monolithic reinforcement-learning agents; multi-agent learning problems; randomly-fleeing prey agent; state spaces; Artificial intelligence; Explosions; Intelligent agent; Intelligent systems; Orbital robotics; Organisms; Pursuit algorithms; Robot kinematics; State-space methods; Systems engineering and theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems '96, IROS 96, Proceedings of the 1996 IEEE/RSJ International Conference on
Conference_Location :
Osaka
Print_ISBN :
0-7803-3213-X
Type :
conf
DOI :
10.1109/IROS.1996.569015
Filename :
569015
Link To Document :
بازگشت