DocumentCode
678672
Title
Intermittently Proving Dynamic Programming to Solve Infinite MDPs on GPUs
Author
Inamoto, Tsutomu ; Higami, Yoshinobu ; Kobayashi, Shin-ya
Author_Institution
Grad. Sch. of Sci. & Eng., Ehime Univ., Matsuyama, Japan
fYear
2013
fDate
4-6 Dec. 2013
Firstpage
252
Lastpage
256
Abstract
In this paper, we propose a variant of the dynamic programming which is suitable for solving infinite Markov decision processes on GPUs. The primary feature of the proposed method is to not always but intermittently transfer and check values for proving the convergence of the procedure. It is expected for the proposed method to decrease computational times by suppressing surplus transfers and checks of values. This expectation is verified through applications of some dynamic programming programs to a simple animat problem and the mountain-car problem.
Keywords
Markov processes; decision making; digital arithmetic; dynamic programming; graphics processing units; GPU; animat problem; computational times; dynamic programming; infinite MDP; infinite Markov decision processes; mountain-car problem; surplus transfers; Approximation methods; Dynamic programming; Equations; Graphics processing units; Instruction sets; Markov processes; TV; GPGPU; dynamic programming;
fLanguage
English
Publisher
ieee
Conference_Titel
Computing and Networking (CANDAR), 2013 First International Symposium on
Conference_Location
Matsuyama
Print_ISBN
978-1-4799-2795-1
Type
conf
DOI
10.1109/CANDAR.2013.44
Filename
6726906
Link To Document