DocumentCode :
2638609
Title :
Implicit estimation of other´s intention without direct observation of actions in a collaborative task: situation-sensitive reinforcement learning
Author :
Taniguchi, Tadahiro ; Ogawa, Kenji ; Sawaragi, Tetsuo
Author_Institution :
Kyoto Univ., Kyoto
fYear :
2007
fDate :
17-20 Sept. 2007
Firstpage :
996
Lastpage :
1003
Abstract :
An agent in a multi-agent environment should adapt to the diversities of dynamics that are caused by changes in the physical properties of the task environment and in social situations concerning how the partner is shifting his/her behaviors to achieve the task. When the partner´s intention changes in the latter, a collaborator agent has to notice this from what is observed in the shared-task environment and to explore how to adaptively collaborate with the partner. A situation-sensitive reinforcement learning (SSRL) architecture is presented in this paper. SSRL enables a collaborator agent to implicitly estimate the partner´s goal. The mathematical basis of the implicit estimates is also addressed. A simple truck-pushing task by a pair of agents is presented as a testbed example, and the simulation results show that organized collaboration could be achieved by an agent embedded with our model in adapting to the partner´s intentional strategic changes.
Keywords :
groupware; learning (artificial intelligence); multi-agent systems; collaborative task; implicit intention estimation; multiagent environment; partner intentional strategy; situation-sensitive reinforcement learning; Autonomous agents; Collaboration; Computational modeling; Computer architecture; Cultural differences; Humans; Informatics; Learning; Robots; Testing; Reinforcement learning; cooperative systems; modular learning; multi-agent systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SICE, 2007 Annual Conference
Conference_Location :
Takamatsu
Print_ISBN :
978-4-907764-27-2
Electronic_ISBN :
978-4-907764-27-2
Type :
conf
DOI :
10.1109/SICE.2007.4421130
Filename :
4421130
Link To Document :
بازگشت