Title :
Utility of Turning Spot Learning under complex goal search and the limit of memory usage
Author :
Tezuka, Yuki ; Notsu, Akira ; Honda, Katsuhiro
Author_Institution :
Dept. of Comput. Sci. & Intell. Syst., Osaka Prefecture Univ., Sakai, Japan
Abstract :
Chain Form Reinforcement Learning (CFRL) was proposed for a reinforcement learning agent using low memory. However, we hold unused information in the memory. In this paper, we introduce Turning Spot Learning (TSL). The method allows an agent to learn with less memory than a CFRL agent. TSL is a method which imitates human perceptions. If we are asked direction, we often tell spot where we changes our action. We call it "Turning Spot". It retains information regarding state, action and distance of a present spot to a next spot. A TSL agent learns only Turning Spots and uses our original action selection method using nearest neighbor algorithm. And, we attempted to limit the amount of the memory usage that a TSL usage can use. Our method was made a comparison to Q-Learning and CFRL in two kinds of goal search problems. We examined performance and discussed the best usage environment.
Keywords :
learning (artificial intelligence); search problems; CFRL agent; Q-learning; TSL agent; chain form reinforcement learning; complex goal search; goal search problems; human perception imitation; memory usage limit; nearest neighbor algorithm; original action selection method; turning spot learning; Games; Learning (artificial intelligence); Memory management; Reliability; Search problems; Standards; Turning; Learning for a low memory agent; Reinforcement learning; State-action set categorization;
Conference_Titel :
Soft Computing and Intelligent Systems (SCIS), 2014 Joint 7th International Conference on and Advanced Intelligent Systems (ISIS), 15th International Symposium on
DOI :
10.1109/SCIS-ISIS.2014.7044656