Title of article

The Convergence of Value Iteration in Discounted Markov Decision Processes

Author/Authors

D.J. White، نويسنده , , W.T. Scherer، نويسنده ,

Issue Information

دوهفته نامه با شماره پیاپی سال 1994

Pages

From page

348

To page

360

Abstract

Considerable numerical experience indicates that the standard value iteration procedure for infinite horizon discounted Markov decision processes performs much better than the usual error bound analysis suggests. This paper attempts to examine why this happens and introduces an additional pointwise convergence concept to that of the usual maximum norm concept, in order to examine why some states exhibit better convergence behaviour than others. We also present some numerical results.

Journal title

Journal of Mathematical Analysis and Applications

Serial Year

1994

Journal title

Journal of Mathematical Analysis and Applications

Record number

938056

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=10&DC=938056