مرکز منطقه ای اطلاع رساني علوم و فناوري - A novel generalized value iteration scheme for uncertain continuous-time linear systems

DocumentCode :

2581922

Title :

A novel generalized value iteration scheme for uncertain continuous-time linear systems

Author :

Lee, Jae Young ; Park, Jin Bae ; Choi, Yoon Ho

Author_Institution :

Dept. of Electr. & Electron. Eng., Yonsei Univ., Seoul, South Korea

fYear :

2010

fDate :

15-17 Dec. 2010

Firstpage :

4637

Lastpage :

4642

Abstract :

In this paper, a novel generalized value iteration (VI) technique is presented which is a reinforcement learning (RL) scheme for solving online the continuous-time (CT) discounted linear quadratic regulation (LQR) problems without exactly knowing the system matrix A. In the proposed method, a discounted value function is considered, which is a general setting in RL frameworks, but not fully considered in RL for CT dynamical systems. Moreover, a stepwise-varying learning rate is introduced for the fast and safe convergence. In relation to this learning rate, we also discuss the locations of the poles of the closed-loop system and monotone convergence to the optimal solution. The results from these discussions give the conditions on the stability and monotone convergence of the existing VI methods.

Keywords :

closed loop systems; continuous time systems; iterative methods; learning (artificial intelligence); linear quadratic control; linear systems; matrix algebra; uncertain systems; closed loop system; continuous-time discounted linear quadratic regulation problem; discounted value function; generalized value iteration scheme; matrix algebra; monotone convergence; reinforcement learning; uncertain continuous-time linear system; Convergence; DC motors; Equations; Heuristic algorithms; Least squares approximation; Trajectory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control (CDC), 2010 49th IEEE Conference on

Conference_Location :

Atlanta, GA

ISSN :

0743-1546

Print_ISBN :

978-1-4244-7745-6

Type :

conf

DOI :

10.1109/CDC.2010.5718015

Filename :

5718015

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2581922