• DocumentCode
    1369992
  • Title

    Aggregation of the policy iteration method for nearly completely decomposable Markov chains

  • Author

    Aldhaheri, Rabah W. ; Khalil, Hassan K.

  • Author_Institution
    Dept. of Electr. Eng., King Abdulaziz Univ., Jeddah, Saudi Arabia
  • Volume
    36
  • Issue
    2
  • fYear
    1991
  • fDate
    2/1/1991 12:00:00 AM
  • Firstpage
    178
  • Lastpage
    187
  • Abstract
    A steady-state optimal control problem is considered for nearly completely decomposable Markov chains. In order to apply the policy iteration method of R.A. Howard (Dynamic Programming and Markov Processes, Cambridge, MA, MIT Press, 1960), a high-dimensional ill-conditioned system of algebraic equations must be solved in the value-determination step. Although algorithms exist for aggregation of the steady-state probability distribution problem, they only provide methods for computing the cost, not the dual variables. Using a singular perturbation approach, an aggregation method for the value-determination equation is developed. The aggregation method is developed in three steps. First, a class of similarity transformations that transform the system into a singularly perturbed form is developed. Second, an aggregation method to compute the steady-state probability distribution is derived. Third, this aggregation method is applied to the value determination step of Howard´s method (1960)
  • Keywords
    Markov processes; aggregation; iterative methods; optimal control; probability; aggregation method; algebraic equations; ill-conditioned system; nearly completely decomposable Markov chains; policy iteration method; similarity transformations; singular perturbation approach; steady-state optimal control problem; steady-state probability distribution; value-determination equation; Application software; Books; Costs; Distributed computing; Equations; Iterative methods; Optimal control; Probability distribution; Steady-state; Transforms;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/9.67293
  • Filename
    67293