• DocumentCode
    1408879
  • Title

    Markov Decisions on a Partitioned State Space

  • Author

    Smith, John L.

  • Author_Institution
    Division of Computing Research, Commonwealth Scientific and Industrial Research Organization, Canberra, Australian Capital Territory, Australia.
  • Issue
    1
  • fYear
    1971
  • Firstpage
    55
  • Lastpage
    60
  • Abstract
    An important practical constraint on admissible control policies is defined for the Markov decision process. The framework of an algorithm based on the infinite return optimization algorithms of Howard and Jewell is suggested to compute the optimal policy under this constraint. Iterative convergence to the optimal policy cannot be guaranteed, but techniques proposed for state-space reduction and rapid resolution of undetermined policies should render many problems tractable.
  • Keywords
    Australia; Control systems; Convergence; Cost function; Iterative algorithms; Mathematical model; Optimal control; Partitioning algorithms; State-space methods; Stochastic systems;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9472
  • Type

    jour

  • DOI
    10.1109/TSMC.1971.5408604
  • Filename
    5408604