Title :
Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations
Author :
Costa, Oswaldo L V ; Aya, Julio C C
Author_Institution :
Dept. de Engenharia Eletronica, Escola Politecnica da Univ. de Sao Paulo, Brazil
Abstract :
We present an iterative technique for deriving the maximal solution of a set of discrete-time coupled algebraic Riccati equations (CARE), based on temporal difference methods. CARE are related to the optimal control of Markovian jump linear systems and have been extensively studied over the last few years. We trace a parallel with the theory of temporal difference algorithms for Markovian decision processes to develop a λ-policy iteration like algorithm for the maximal solution of these equations. For the special cases in which λ=0 and λ=1 we have the situation in which the algorithm reduces to the iterations of the Riccati difference equations (value iteration) and quasi-linearization method (policy iteration) respectively. The advantage of the proposed method is that an appropriate choice of λ between 0 and 1 can speed up the convergence of the policy evaluation step of the policy iteration method by using value iteration
Keywords :
Banach spaces; Markov processes; Riccati equations; convergence; decision theory; difference equations; discrete time systems; iterative methods; optimal control; λ-policy iteration like algorithm; Markovian decision processes; Riccati difference equations; discrete-time coupled algebraic Riccati equations; iterative technique; maximal solution; policy iteration; quasi-linearization method; temporal difference methods; value iteration; Convergence; Costs; Difference equations; Feedback control; Infinite horizon; Iterative methods; Linear systems; Optimal control; Riccati equations; Tin;
Conference_Titel :
American Control Conference, 1999. Proceedings of the 1999
Conference_Location :
San Diego, CA
Print_ISBN :
0-7803-4990-3
DOI :
10.1109/ACC.1999.786152