Incremental Value Iteration for Time-Aggregated Markov-Decision Processes

Author

Sun, Tao ; Zhao, Qianchuan ; Luh, Peter B.

Author_Institution

Tsinghua Univ., Beijing

Volume

Issue

fYear

2007

Firstpage

2177

Lastpage

2182

Abstract

A value iteration algorithm for time-aggregated Markov-decision processes (MDPs) is developed to solve problems with large state spaces. The algorithm is based on a novel approach which solves a time aggregated MDP by incrementally solving a set of standard MDPs. Therefore, the algorithm converges under the same assumption as standard value iteration. Such assumption is much weaker than that required by the existing time aggregated value iteration algorithm. The algorithms developed in this paper are also applicable to MDPs with fractional costs.

Keywords

Markov processes; decision theory; iterative methods; fractional costs; incremental value iteration; large state spaces; time-aggregated Markov-decision processes; Artificial intelligence; Automation; Costs; Equations; Intelligent systems; International collaboration; Operations research; Standards development; State-space methods; Sun; Fractional cost; Markov-decision processes (MDPs); policy iteration; time aggregation; value iteration;

fLanguage

English

Journal_Title

Automatic Control, IEEE Transactions on

Publisher

ieee

ISSN

0018-9286

Type

jour

DOI

10.1109/TAC.2007.908359

Filename

4380511

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=971240