Title :
Monte Carlo TD(λ)-methods for the optimal control of discrete-time Markovian jump linear systems
Author :
Costa, Oswaldo L V ; Aya, Julio C C
Author_Institution :
Dept. de Engenharia de Telecomunicacoes e Controle, Sao Paulo Univ., Brazil
Abstract :
In this paper we present an iterative technique based on Monte Carlo simulations for deriving the optimal control of the infinite horizon linear regulator problem of discrete-time Markovian jump linear systems for the case in which the transition probability matrix of the Markov chain is not known. It is well known that the optimal control of this problem is given in terms of the maximal solution of a set of coupled algebraic Riccati equations (CARE), which have been extensively studied over the last few years. We trace a parallel with the theory of TD(λ) algorithms for Markovian decision processes to develop a TD(λ) like algorithm for the optimal control associated to the maximal solution of the CARE. Some numerical examples are also presented
Keywords :
Markov processes; Monte Carlo methods; Riccati equations; discrete time systems; iterative methods; linear systems; optimal control; stochastic systems; CARE; Markov chain; Monte Carlo TD(λ)-methods; Monte Carlo simulations; coupled algebraic Riccati equations; discrete-time Markovian jump linear systems; infinite horizon linear regulator problem; iterative technique; optimal control; transition probability matrix; Convergence; Councils; Feedback control; Hilbert space; Monte Carlo methods; Optimal control; Riccati equations;
Conference_Titel :
Decision and Control, 2000. Proceedings of the 39th IEEE Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
0-7803-6638-7
DOI :
10.1109/CDC.2000.912015