• DocumentCode
    971240
  • Title

    Incremental Value Iteration for Time-Aggregated Markov-Decision Processes

  • Author

    Sun, Tao ; Zhao, Qianchuan ; Luh, Peter B.

  • Author_Institution
    Tsinghua Univ., Beijing
  • Volume
    52
  • Issue
    11
  • fYear
    2007
  • Firstpage
    2177
  • Lastpage
    2182
  • Abstract
    A value iteration algorithm for time-aggregated Markov-decision processes (MDPs) is developed to solve problems with large state spaces. The algorithm is based on a novel approach which solves a time aggregated MDP by incrementally solving a set of standard MDPs. Therefore, the algorithm converges under the same assumption as standard value iteration. Such assumption is much weaker than that required by the existing time aggregated value iteration algorithm. The algorithms developed in this paper are also applicable to MDPs with fractional costs.
  • Keywords
    Markov processes; decision theory; iterative methods; fractional costs; incremental value iteration; large state spaces; time-aggregated Markov-decision processes; Artificial intelligence; Automation; Costs; Equations; Intelligent systems; International collaboration; Operations research; Standards development; State-space methods; Sun; Fractional cost; Markov-decision processes (MDPs); policy iteration; time aggregation; value iteration;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.2007.908359
  • Filename
    4380511