DocumentCode :
997685
Title :
Two-Time Scale Controlled Markov Chains: A Decomposition and Parallel Processing Approach
Author :
Haurie, Alain ; Moresino, Francesco
Author_Institution :
Univ. de Geneve, Chene-Bougeries
Volume :
52
Issue :
12
fYear :
2007
Firstpage :
2325
Lastpage :
2331
Abstract :
This correspondence deals with a class of ergodic control problems for systems described by Markov chains with strong and weak interactions. These systems are composed of a set of subchains that are weakly coupled. Using results already available in the literature one formulates a limit control problem the solution of which can be obtained via an associated nondifferentiable convex programming (NDCP) problem. The technique used to solve the NDCP problem is the Analytic Center Cutting Plane Method (ACCPM) which implements a dialogue between, on one hand, a master program computing the analytical center of a localization set containing the solution and, on the other hand, an oracle proposing cutting planes that reduce the size of the localization set at each main iteration. The interesting aspect of this implementation comes from two characteristics: (i) the oracle proposes cutting planes by solving reduced sized Markov Decision Problems (MDP) via a linear program (LP) or a policy iteration method; (ii) several cutting planes can be proposed simultaneously through a parallel implementation on processors. The correspondence concentrates on these two aspects and shows, on a large scale MDP obtained from the numerical approximation ldquoa la Kushner-Dupuisrdquo of a singularly perturbed hybrid stochastic control problem, the important computational speed-up obtained.
Keywords :
Markov processes; approximation theory; convex programming; iterative methods; linear programming; optimal control; parallel processing; stochastic systems; Markov decision problems; analytic center cutting plane method; cutting planes; decomposition approach; ergodic control problems; hybrid stochastic control problem; linear program; master program computing; nondifferentiable convex programming; parallel processing approach; policy iteration method; two-time scale controlled Markov chains; Books; Computational complexity; Control systems; Costs; Large-scale systems; Linear programming; Markov processes; Optimal control; Parallel processing; Stochastic processes; Decomposition technique; interior point method; singular perturbations; two-timescale Markov chain;
fLanguage :
English
Journal_Title :
Automatic Control, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9286
Type :
jour
DOI :
10.1109/TAC.2007.910692
Filename :
4395177
Link To Document :
بازگشت