• DocumentCode
    453874
  • Title

    Option Discovery in Reinforcement Learning using Frequent Common Subsequences of Actions

  • Author

    Girgin, Sertan ; Polat, Faruk

  • Author_Institution
    Dept. of Comput. Eng., Middle East Tech. Univ., Ankara
  • Volume
    1
  • fYear
    2005
  • fDate
    28-30 Nov. 2005
  • Firstpage
    371
  • Lastpage
    376
  • Abstract
    Temporally abstract actions, or options, facilitate learning in large and complex domains by exploiting sub-tasks and hierarchical structure of the problem formed by these sub-tasks. In this paper, we study automatic generation of options using common sub-sequences derived from the state transition histories collected as learning progresses. The standard Q-learning algorithm is extended to use generated options transparently, and effectiveness of the method is demonstrated in Dietterich´s Taxi domain
  • Keywords
    Markov processes; learning (artificial intelligence); set theory; Dietterich Taxi domain; frequent common subsequence; option discovery; reinforcement learning; standard Q-learning algorithm; temporally abstract action; Acceleration; Clustering algorithms; Computational intelligence; Computational modeling; History; Joining processes; Learning; Partitioning algorithms; State-space methods; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence for Modelling, Control and Automation, 2005 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on
  • Conference_Location
    Vienna
  • Print_ISBN
    0-7695-2504-0
  • Type

    conf

  • DOI
    10.1109/CIMCA.2005.1631294
  • Filename
    1631294