Title :
Automatic construct of options in RL
Author :
Ming-Liang, Xu ; Jun, Sun ; Wen-bo, Xu
Author_Institution :
Sch. of Inf. Technol., Jiangnan Univ., Wuxi, China
Abstract :
The taboo state is introduced in environment to discovery sub-goal. Agent samples trajectories from starting state to goal state, which contain different bottlenecks. Then the different tasks are submitted to agent. According to whether the task is accomplished or not, the bottlenecks among them are discovered. The appropriate bottlenecks are selected as sub-goal of options to be constructed according to the adjacent relationship among them. Simultaneously agent can obtain the initial set and the policies of options. Grid-world tasks illustrate that the agent can automatically construct useful options online with the proposed method, which have capability of accelerating learning and the transference of knowledge among those similar learning tasks.
Keywords :
decision making; grid computing; learning (artificial intelligence); multi-agent systems; grid-world tasks; knowledge transference; learning tasks; machine learning framework; reinforcement learning; sequential decision making problems; taboo state; Accelerated aging; Algorithm design and analysis; Artificial intelligence; Automatic control; Decision making; Frequency; Information technology; Machine learning; Sun; Testing; Q-learning; hierarchical reinforcement learning; option; subgoal; taboo search;
Conference_Titel :
Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-4754-1
Electronic_ISBN :
978-1-4244-4738-1
DOI :
10.1109/ICICISYS.2009.5357937