Option Discovery in Reinforcement Learning using Frequent Common Subsequences of Actions

Author

Girgin, Sertan ; Polat, Faruk

Author_Institution

Dept. of Comput. Eng., Middle East Tech. Univ., Ankara

Volume

1

fYear

2005

fDate

28-30 Nov. 2005

Firstpage

371

Lastpage

376

Abstract

Temporally abstract actions, or options, facilitate learning in large and complex domains by exploiting sub-tasks and hierarchical structure of the problem formed by these sub-tasks. In this paper, we study automatic generation of options using common sub-sequences derived from the state transition histories collected as learning progresses. The standard Q-learning algorithm is extended to use generated options transparently, and effectiveness of the method is demonstrated in Dietterich´s Taxi domain

Keywords

Markov processes; learning (artificial intelligence); set theory; Dietterich Taxi domain; frequent common subsequence; option discovery; reinforcement learning; standard Q-learning algorithm; temporally abstract action; Acceleration; Clustering algorithms; Computational intelligence; Computational modeling; History; Joining processes; Learning; Partitioning algorithms; State-space methods; Topology;

fLanguage

English

Publisher

ieee

Conference_Titel

Computational Intelligence for Modelling, Control and Automation, 2005 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on

Conference_Location

Vienna

Print_ISBN

0-7695-2504-0

Type

conf

DOI

10.1109/CIMCA.2005.1631294

Filename

1631294