DocumentCode
971240
Title
Incremental Value Iteration for Time-Aggregated Markov-Decision Processes
Author
Sun, Tao ; Zhao, Qianchuan ; Luh, Peter B.
Author_Institution
Tsinghua Univ., Beijing
Volume
52
Issue
11
fYear
2007
Firstpage
2177
Lastpage
2182
Abstract
A value iteration algorithm for time-aggregated Markov-decision processes (MDPs) is developed to solve problems with large state spaces. The algorithm is based on a novel approach which solves a time aggregated MDP by incrementally solving a set of standard MDPs. Therefore, the algorithm converges under the same assumption as standard value iteration. Such assumption is much weaker than that required by the existing time aggregated value iteration algorithm. The algorithms developed in this paper are also applicable to MDPs with fractional costs.
Keywords
Markov processes; decision theory; iterative methods; fractional costs; incremental value iteration; large state spaces; time-aggregated Markov-decision processes; Artificial intelligence; Automation; Costs; Equations; Intelligent systems; International collaboration; Operations research; Standards development; State-space methods; Sun; Fractional cost; Markov-decision processes (MDPs); policy iteration; time aggregation; value iteration;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/TAC.2007.908359
Filename
4380511
Link To Document