Title of article
The Convergence of Value Iteration in Discounted Markov Decision Processes
Author/Authors
D.J. White، نويسنده , , W.T. Scherer، نويسنده ,
Issue Information
دوهفته نامه با شماره پیاپی سال 1994
Pages
13
From page
348
To page
360
Abstract
Considerable numerical experience indicates that the standard value iteration procedure for infinite horizon discounted Markov decision processes performs much better than the usual error bound analysis suggests. This paper attempts to examine why this happens and introduces an additional pointwise convergence concept to that of the usual maximum norm concept, in order to examine why some states exhibit better convergence behaviour than others. We also present some numerical results.
Journal title
Journal of Mathematical Analysis and Applications
Serial Year
1994
Journal title
Journal of Mathematical Analysis and Applications
Record number
938056
Link To Document