Title :
Acquisition of coordinated behavior by modular Q-learning agents
Author :
ONO, Norihiko ; Ikeda, Osamu ; Fukumoto, Kenji
Author_Institution :
Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan
Abstract :
Recent attempts to let monolithic reinforcement-learning agents synthesize coordinated behavior scale poorly to more complicated multi-agent learning problems where multiple learning agents play different roles and work together for the accomplishment of their common goal. These learning agents have to receive and respond to various sensory information from their partners as well as that from the physical environment itself. Hence, their state spaces are subject to grow exponentially in the number of the partners. As an illustrative problem suffered from this kind of combinatorial explosion, we consider a modified version of the pursuit problem, and show how successfully a collection of modular Q-learning hunter agents synthesize coordinated decision policies needed to capture a randomly-fleeing prey agent effectively, by specializing their functionality and acquiring herding behavior
Keywords :
cooperative systems; game theory; learning (artificial intelligence); state-space methods; combinatorial explosion; coordinated behavior acquisition; functionality specialization; herding behavior; modular Q-learning hunter agents; monolithic reinforcement-learning agents; multi-agent learning problems; randomly-fleeing prey agent; state spaces; Artificial intelligence; Explosions; Intelligent agent; Intelligent systems; Orbital robotics; Organisms; Pursuit algorithms; Robot kinematics; State-space methods; Systems engineering and theory;
Conference_Titel :
Intelligent Robots and Systems '96, IROS 96, Proceedings of the 1996 IEEE/RSJ International Conference on
Conference_Location :
Osaka
Print_ISBN :
0-7803-3213-X
DOI :
10.1109/IROS.1996.569015