• DocumentCode
    3563645
  • Title

    Utility of Turning Spot Learning under complex goal search and the limit of memory usage

  • Author

    Tezuka, Yuki ; Notsu, Akira ; Honda, Katsuhiro

  • Author_Institution
    Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
  • fYear
    2014
  • Firstpage
    1418
  • Lastpage
    1423
  • Abstract
    Chain Form Reinforcement Learning (CFRL) was proposed for a reinforcement learning agent using low memory. However, we hold unused information in the memory. In this paper, we introduce Turning Spot Learning (TSL). The method allows an agent to learn with less memory than a CFRL agent. TSL is a method which imitates human perceptions. If we are asked direction, we often tell spot where we changes our action. We call it "Turning Spot". It retains information regarding state, action and distance of a present spot to a next spot. A TSL agent learns only Turning Spots and uses our original action selection method using nearest neighbor algorithm. And, we attempted to limit the amount of the memory usage that a TSL usage can use. Our method was made a comparison to Q-Learning and CFRL in two kinds of goal search problems. We examined performance and discussed the best usage environment.
  • Keywords
    learning (artificial intelligence); search problems; CFRL agent; Q-learning; TSL agent; chain form reinforcement learning; complex goal search; goal search problems; human perception imitation; memory usage limit; nearest neighbor algorithm; original action selection method; turning spot learning; Games; Learning (artificial intelligence); Memory management; Reliability; Search problems; Standards; Turning; Learning for a low memory agent; Reinforcement learning; State-action set categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Soft Computing and Intelligent Systems (SCIS), 2014 Joint 7th International Conference on and Advanced Intelligent Systems (ISIS), 15th International Symposium on
  • Type

    conf

  • DOI
    10.1109/SCIS-ISIS.2014.7044656
  • Filename
    7044656