DocumentCode
1408879
Title
Markov Decisions on a Partitioned State Space
Author
Smith, John L.
Author_Institution
Division of Computing Research, Commonwealth Scientific and Industrial Research Organization, Canberra, Australian Capital Territory, Australia.
Issue
1
fYear
1971
Firstpage
55
Lastpage
60
Abstract
An important practical constraint on admissible control policies is defined for the Markov decision process. The framework of an algorithm based on the infinite return optimization algorithms of Howard and Jewell is suggested to compute the optimal policy under this constraint. Iterative convergence to the optimal policy cannot be guaranteed, but techniques proposed for state-space reduction and rapid resolution of undetermined policies should render many problems tractable.
Keywords
Australia; Control systems; Convergence; Cost function; Iterative algorithms; Mathematical model; Optimal control; Partitioning algorithms; State-space methods; Stochastic systems;
fLanguage
English
Journal_Title
Systems, Man and Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
0018-9472
Type
jour
DOI
10.1109/TSMC.1971.5408604
Filename
5408604
Link To Document