Title :
Decomposition and parallel processing techniques for two-time scale controlled Markov chains
Author :
Filar, J.A. ; Gondzio, J. ; Haurie, A. ; Moresino, F. ; Vial, J. Ph
Author_Institution :
Sch. of Math., Univ. of South Australia, Australia
Abstract :
Deals with a class of ergodic control problems for systems described by Markov chains with strong and weak interactions. These systems are composed of a set of m subchains that are weakly coupled. Using results established by Abbad et al. (1992). We formulate a limit control problem the solution of which can be obtained via an associated nondifferentiable convex programming (NDCP) problem. The technique used to solve the NDCP problem is the analytic center cutting plane method (ACCPM) which implements a dialogue between, on one hand, a master program computing the analytical center of a localization set containing the solution and, on the other hand, an oracle proposing cutting planes that reduce the size of the localization set at each main iteration. The interesting aspect of this implementation comes from two characteristics: (i) the oracle proposes cutting planes by solving reduced sized Markov decision problems (MDP) via a linear program (LP) or a policy iteration method; (ii) several cutting planes can be proposed simultaneously through a parallel implementation on m processors. The paper concentrates on these two aspects and shows, on a large scale MDP obtained from the numerical approximation “a la Kushner-Dupuis” of a singularly perturbed hybrid stochastic control problem, the important computational speed-up obtained
Keywords :
Markov processes; convex programming; decision theory; linear programming; parallel processing; singularly perturbed systems; stochastic systems; analytic center cutting plane method; computational speed-up; ergodic control problems; limit control problem; localization set; master program; nondifferentiable convex programming problem; policy iteration method; reduced sized Markov decision problems; singularly perturbed hybrid stochastic control problem; strong interactions; two-time scale controlled Markov chains; weak interactions; Books; Control systems; Costs; Large-scale systems; Linear programming; Mathematics; Optimal control; Parallel processing; Polynomials; Stochastic processes;
Conference_Titel :
Decision and Control, 2000. Proceedings of the 39th IEEE Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
0-7803-6638-7
DOI :
10.1109/CDC.2000.912851