DocumentCode
3215630
Title
A comparative study of policies in Q-learning for foraging tasks
Author
Mohan, Yogeswaran ; Ponnambalam, S.G. ; Inayat-Hussain, Jawaid I.
Author_Institution
Sch. of Eng., Monash Univ., Petaling Jaya, Malaysia
fYear
2009
fDate
9-11 Dec. 2009
Firstpage
134
Lastpage
139
Abstract
Q-learning is a machine learning technique that learns what to do and how to map states to actions to maximize rewards. Q-learning has been applied to various tasks such as foraging, soccer and prey-pursuing robots. In this paper, a simple foraging task has been considered to study the influences of the policies reported in the open literatures. A mobile robot is used to search and retrieve pucks back to a home location. The goal of this study is to identify an efficient policy for q-learning which maximizes the number of pucks collected and minimizes the number of collisions in the environment. Policies namely greedy, epsilon-greedy, Boltzmann distribution and random search are used to study their performances in the foraging task and the results are presented.
Keywords
learning (artificial intelligence); mobile robots; random processes; search problems; Boltzmann distribution; Q-learning; epsilon-greedy policy; foraging task; machine learning; mobile robot; random search; Boltzmann distribution; Convergence; Machine learning; Mechanical engineering; Mobile robots; Scattering; Testing; exploration-exploitation; foraging; machine learning; mobile-robot; policy; q-learning; reinforcement learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Nature & Biologically Inspired Computing, 2009. NaBIC 2009. World Congress on
Conference_Location
Coimbatore
Print_ISBN
978-1-4244-5053-4
Type
conf
DOI
10.1109/NABIC.2009.5393616
Filename
5393616
Link To Document