Title :
Intermittently Proving Dynamic Programming to Solve Infinite MDPs on GPUs
Author :
Inamoto, Tsutomu ; Higami, Yoshinobu ; Kobayashi, Shin-ya
Author_Institution :
Grad. Sch. of Sci. & Eng., Ehime Univ., Matsuyama, Japan
Abstract :
In this paper, we propose a variant of the dynamic programming which is suitable for solving infinite Markov decision processes on GPUs. The primary feature of the proposed method is to not always but intermittently transfer and check values for proving the convergence of the procedure. It is expected for the proposed method to decrease computational times by suppressing surplus transfers and checks of values. This expectation is verified through applications of some dynamic programming programs to a simple animat problem and the mountain-car problem.
Keywords :
Markov processes; decision making; digital arithmetic; dynamic programming; graphics processing units; GPU; animat problem; computational times; dynamic programming; infinite MDP; infinite Markov decision processes; mountain-car problem; surplus transfers; Approximation methods; Dynamic programming; Equations; Graphics processing units; Instruction sets; Markov processes; TV; GPGPU; dynamic programming;
Conference_Titel :
Computing and Networking (CANDAR), 2013 First International Symposium on
Conference_Location :
Matsuyama
Print_ISBN :
978-1-4799-2795-1
DOI :
10.1109/CANDAR.2013.44