مرکز منطقه ای اطلاع رساني علوم و فناوري - Acquisition of coordinated behavior by modular Q-learning agents

DocumentCode :

2121054

Title :

Acquisition of coordinated behavior by modular Q-learning agents

Author :

ONO, Norihiko ; Ikeda, Osamu ; Fukumoto, Kenji

Author_Institution :

Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan

Volume :

fYear :

1996

fDate :

4-8 Nov 1996

Firstpage :

1525

Abstract :

Recent attempts to let monolithic reinforcement-learning agents synthesize coordinated behavior scale poorly to more complicated multi-agent learning problems where multiple learning agents play different roles and work together for the accomplishment of their common goal. These learning agents have to receive and respond to various sensory information from their partners as well as that from the physical environment itself. Hence, their state spaces are subject to grow exponentially in the number of the partners. As an illustrative problem suffered from this kind of combinatorial explosion, we consider a modified version of the pursuit problem, and show how successfully a collection of modular Q-learning hunter agents synthesize coordinated decision policies needed to capture a randomly-fleeing prey agent effectively, by specializing their functionality and acquiring herding behavior

Keywords :

cooperative systems; game theory; learning (artificial intelligence); state-space methods; combinatorial explosion; coordinated behavior acquisition; functionality specialization; herding behavior; modular Q-learning hunter agents; monolithic reinforcement-learning agents; multi-agent learning problems; randomly-fleeing prey agent; state spaces; Artificial intelligence; Explosions; Intelligent agent; Intelligent systems; Orbital robotics; Organisms; Pursuit algorithms; Robot kinematics; State-space methods; Systems engineering and theory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Robots and Systems '96, IROS 96, Proceedings of the 1996 IEEE/RSJ International Conference on

Conference_Location :

Osaka

Print_ISBN :

0-7803-3213-X

Type :

conf

DOI :

10.1109/IROS.1996.569015

Filename :

569015

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2121054