DocumentCode :
663526
Title :
Lifelong transfer learning with an option hierarchy
Author :
Hawasly, Majd ; Ramamoorthy, Subramanian
Author_Institution :
Sch. of Inf., Univ. of Edinburgh, Edinburgh, UK
fYear :
2013
fDate :
3-7 Nov. 2013
Firstpage :
1341
Lastpage :
1346
Abstract :
Many applications require autonomous agents to achieve quick responses to task instances drawn from a rich family of qualitatively-related tasks. We address the setting where the tasks share a state-action space and have the same qualitative objective but differ in dynamics. We adopt a transfer learning approach where common structure in previously-learnt policies, in the form of shared subtasks, is exploited to accelerate learning in subsequent ones. We use a probabilistic mixture model to describe regions in state space which are common to successful trajectories in different instances. Then, we extract policy fragments from previously-learnt policies that are specialised to these regions. These policy fragments are options, whose initiation and termination sets are automatically extracted from data by the mixture model. In novel task instances, these options are used in an SMDP learning process and option learning repeats over the resulting policy library. The utility of this method is demonstrated through experiments in a standard navigation environment and then in the RoboCup simulated soccer domain with opponent teams of different skill.
Keywords :
continuing professional development; learning (artificial intelligence); multi-robot systems; probability; RoboCup simulated soccer domain; SMDP learning process; autonomous agents; lifelong transfer learning approach; option hierarchy; option learning; policy fragments; policy library; probabilistic mixture model; state-action space; Abstracts; Acceleration; Kernel; Markov processes; Navigation; Probabilistic logic; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
Conference_Location :
Tokyo
ISSN :
2153-0858
Type :
conf
DOI :
10.1109/IROS.2013.6696523
Filename :
6696523
Link To Document :
بازگشت