DocumentCode :
2110575
Title :
Knowledge-Based Exploration for Reinforcement Learning in Self-Organizing Neural Networks
Author :
Teck-Hou Teng ; Ah-Hwee Tan
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume :
2
fYear :
2012
fDate :
4-7 Dec. 2012
Firstpage :
332
Lastpage :
339
Abstract :
Exploration is necessary during reinforcement learning to discover new solutions in a given problem space. Most reinforcement learning systems, however, adopt a simple strategy, by randomly selecting an action among all the available actions. This paper proposes a novel exploration strategy, known as Knowledge-based Exploration, for guiding the exploration of a family of self-organizing neural networks in reinforcement learning. Specifically, exploration is directed towards unexplored and favorable action choices while steering away from those negative action choices that are likely to fail. This is achieved by using the learned knowledge of the agent to identify prior action choices leading to low Q-values in similar situations. Consequently, the agent is expected to learn the right solutions in a shorter time, improving overall learning efficiency. Using a Pursuit-Evasion problem domain, we evaluate the efficacy of the knowledge-based exploration strategy, in terms of task performance, rate of learning and model complexity. Comparison with random exploration and three other heuristic-based directed exploration strategies show that Knowledge-based Exploration is significantly more effective and robust for reinforcement learning in real time.
Keywords :
learning (artificial intelligence); multi-agent systems; self-organising feature maps; action choices; agent learning; heuristic-based directed exploration strategy; knowledge-based exploration; learning efficiency; learning rate; model complexity; pursuit-evasion problem domain; random action selection; random exploration; reinforcement learning system; self-organizing neural network; solution learning; task performance; Directed Exploration; Reinforcement Learning; Rule-Based System; Self-Organizing Neural Network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-6057-9
Type :
conf
DOI :
10.1109/WI-IAT.2012.154
Filename :
6511590
Link To Document :
بازگشت