Title :
A comparative study of policies in Q-learning for foraging tasks
Author :
Mohan, Yogeswaran ; Ponnambalam, S.G. ; Inayat-Hussain, Jawaid I.
Author_Institution :
Sch. of Eng., Monash Univ., Petaling Jaya, Malaysia
Abstract :
Q-learning is a machine learning technique that learns what to do and how to map states to actions to maximize rewards. Q-learning has been applied to various tasks such as foraging, soccer and prey-pursuing robots. In this paper, a simple foraging task has been considered to study the influences of the policies reported in the open literatures. A mobile robot is used to search and retrieve pucks back to a home location. The goal of this study is to identify an efficient policy for q-learning which maximizes the number of pucks collected and minimizes the number of collisions in the environment. Policies namely greedy, epsilon-greedy, Boltzmann distribution and random search are used to study their performances in the foraging task and the results are presented.
Keywords :
learning (artificial intelligence); mobile robots; random processes; search problems; Boltzmann distribution; Q-learning; epsilon-greedy policy; foraging task; machine learning; mobile robot; random search; Boltzmann distribution; Convergence; Machine learning; Mechanical engineering; Mobile robots; Scattering; Testing; exploration-exploitation; foraging; machine learning; mobile-robot; policy; q-learning; reinforcement learning;
Conference_Titel :
Nature & Biologically Inspired Computing, 2009. NaBIC 2009. World Congress on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4244-5053-4
DOI :
10.1109/NABIC.2009.5393616