DocumentCode :
1743477
Title :
Monte Carlo TD(λ)-methods for the optimal control of discrete-time Markovian jump linear systems
Author :
Costa, Oswaldo L V ; Aya, Julio C C
Author_Institution :
Dept. de Engenharia de Telecomunicacoes e Controle, Sao Paulo Univ., Brazil
Volume :
2
fYear :
2000
fDate :
2000
Firstpage :
1183
Abstract :
In this paper we present an iterative technique based on Monte Carlo simulations for deriving the optimal control of the infinite horizon linear regulator problem of discrete-time Markovian jump linear systems for the case in which the transition probability matrix of the Markov chain is not known. It is well known that the optimal control of this problem is given in terms of the maximal solution of a set of coupled algebraic Riccati equations (CARE), which have been extensively studied over the last few years. We trace a parallel with the theory of TD(λ) algorithms for Markovian decision processes to develop a TD(λ) like algorithm for the optimal control associated to the maximal solution of the CARE. Some numerical examples are also presented
Keywords :
Markov processes; Monte Carlo methods; Riccati equations; discrete time systems; iterative methods; linear systems; optimal control; stochastic systems; CARE; Markov chain; Monte Carlo TD(λ)-methods; Monte Carlo simulations; coupled algebraic Riccati equations; discrete-time Markovian jump linear systems; infinite horizon linear regulator problem; iterative technique; optimal control; transition probability matrix; Convergence; Councils; Feedback control; Hilbert space; Monte Carlo methods; Optimal control; Riccati equations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 2000. Proceedings of the 39th IEEE Conference on
Conference_Location :
Sydney, NSW
ISSN :
0191-2216
Print_ISBN :
0-7803-6638-7
Type :
conf
DOI :
10.1109/CDC.2000.912015
Filename :
912015
Link To Document :
بازگشت