Title :
Reduction of temporal complexity in Markov decision processes
Author :
Garcia-Hernandez, M.G. ; Ruiz-Pinales, J. ; Ledesma-Orozco, S. ; Avina-Cervantes, G.
Author_Institution :
Univ. de Guanajuato, Salamanca, Mexico
Abstract :
In this paper we propose a new algorithm in order to reduce temporal complexity in Markov decision processes. Value iteration is a classical algorithm for solving them, but this algorithm and its variants are quite slow for discount factors close to one and their convergence properties depend to a great extent on a good state update order. It has been shown that improved topological value iteration presents a good convergence speed thanks to the use of an improved topological ordering. Nevertheless, its drawback is due to high memory requirements. So, our algorithm obtains the optimal state backup order with less memory requirements. Experimental results on stochastic shortest-path problems (highly cyclic) are presented. Our approach obtained a considerable reduction in temporal complexity with respect to other variants of value iteration. For instance, the experiments showed in one test a reduction of six times with respect to asynchronous value iteration.
Keywords :
Markov processes; back-up procedures; computational complexity; convergence of numerical methods; iterative methods; storage management; Markov decision process; asynchronous value iteration; convergence properties; memory requirements; optimal state backup order; stochastic shortest-path problems; temporal complexity; topological value iteration; Acceleration; Complexity theory; Convergence; Equations; Markov processes; Memory management; Dijkstra algorithm; Markov decision processes; priority queues;
Conference_Titel :
Electrical Communications and Computers (CONIELECOMP), 2012 22nd International Conference on
Conference_Location :
Cholula, Puebla
Print_ISBN :
978-1-4577-1326-2
DOI :
10.1109/CONIELECOMP.2012.6189930