Title :
Aggregation-disaggregation algorithm for ε2-singularly perturbed limiting average Markov control problems
Author :
Abbad, Mohammed ; Filar, Jerzy A.
Author_Institution :
Dept. of Math. & Stat., Maryland Univ., Baltimore, MD, USA
Abstract :
Finite state and action Markov decision processes (MDPs) are dynamic, stochastic systems controlled by a controller. These models are usually referred to as Markovian control problems (MCPs). The authors consider a singular perturbation of order 2 for a Markov decision process with the limiting average reward criterion. They define a singular perturbation of order 2 in the following sense: it is assumed that the underlying process is composed of n separate irreducible processes, and that a small ε-perturbation is such that it unites these processes into m separate irreducible processes. Then another small ε2-perturbation is such that it unites these latter processes into a single irreducible process. The singular perturbation of order 2 is formulated. The limit MCP that is entirely different from the original unperturbed MDP, which forms an appropriate asymptotic approximation to a whole family of perturbed problems, is given explicitly. Thus, only the single limit MCP needs to be solved. An aggregation-disaggregation algorithm is constructed for solving the limit MCP
Keywords :
Markov processes; decision theory; perturbation techniques; stochastic systems; Markov control problems; Markov decision processes; aggregation-disaggregation algorithm; dynamic stochastic systems; irreducible processes; limiting average reward criterion; singular perturbation; Application software; Communication networks; Computer network management; Computer networks; Context modeling; Control systems; Mathematics; Performance analysis; Statistics; Stochastic systems;
Conference_Titel :
Decision and Control, 1991., Proceedings of the 30th IEEE Conference on
Conference_Location :
Brighton
Print_ISBN :
0-7803-0450-0
DOI :
10.1109/CDC.1991.261346