Markov Decisions on a Partitioned State Space

Author

Smith, John L.

Author_Institution

Division of Computing Research, Commonwealth Scientific and Industrial Research Organization, Canberra, Australian Capital Territory, Australia.

Issue

fYear

1971

Firstpage

Lastpage

Abstract

An important practical constraint on admissible control policies is defined for the Markov decision process. The framework of an algorithm based on the infinite return optimization algorithms of Howard and Jewell is suggested to compute the optimal policy under this constraint. Iterative convergence to the optimal policy cannot be guaranteed, but techniques proposed for state-space reduction and rapid resolution of undetermined policies should render many problems tractable.

Keywords

Australia; Control systems; Convergence; Cost function; Iterative algorithms; Mathematical model; Optimal control; Partitioning algorithms; State-space methods; Stochastic systems;

fLanguage

English

Journal_Title

Systems, Man and Cybernetics, IEEE Transactions on

Publisher

ieee

ISSN

0018-9472

Type

jour

DOI

10.1109/TSMC.1971.5408604

Filename

5408604

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=1408879