Implementation of Q — Learning algorithm for solving maze problem

Author

D. Osmanković;S. Konjicija

Author_Institution

Department of Automatic Control and Electronics, University in Sarajevo, Faculty of Electrical Engineering in Sarajevo, Zmaja od Bosne bb, 71000 Sarajevo, Bosnia and Herzegovina

fYear

2011

fDate

5/1/2011 12:00:00 AM

Firstpage

1619

Lastpage

1622

Abstract

Machine learning is very important in several fields ranging from control systems to data mining. This paper presents Q - Learning implementation for abstract graph models with maze solving (finding the trajectory out of the maze) taken as example of graph problem. The paper consists of conversion of maze matrix to Q - Learning reward matrix, and also the implementation of Q - Learning algorithm for the reward matrix (similar to minimizing criteria matrix in dynamic programming). This implementation is on higher level of abstraction, so other representations can be used (artificial neural networks, tree etc.). For the testing of Q - Learning algorithm, maze solving problem was visualized in MATLAB programming language with the found trajectory marked on the maze. The maze in this paper is defined with starting position in the top left corner and the exit in the bottom right corner. The performance of the algorithm is measured for different scales of the problem.

Publisher

ieee

Conference_Titel

MIPRO, 2011 Proceedings of the 34th International Convention

Print_ISBN

978-1-4577-0996-8

Type

conf

Filename

5967320

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3642752