DocumentCode :
183740
Title :
Fixed final-time near optimal regulation of nonlinear discrete-time systems in affine form using output feedback
Author :
Qiming Zhao ; Hao Xu ; Jagannathan, Sarangapani
Author_Institution :
Dept. of Electr. & Comput. Eng., Missouri Univ. of Sci. & Technol., Rolla, MO, USA
fYear :
2014
fDate :
4-6 June 2014
Firstpage :
4643
Lastpage :
4648
Abstract :
In this paper, the fixed final-time near optimal output regulation of affine nonlinear discrete-time systems with unknown system dynamics is considered. First, a neural network (NN)-based observer is proposed to reconstruct both the system state vector and control coefficient matrix. Next, actor-critic structure is utilized to approximate the time-varying solution of the Hamilton-Jacobi-Bellman (HJB) equation or value function. To satisfy the terminal constraint, a new error term is defined and incorporated in the NN update law so that the terminal constraint error is also minimized over time. A NN with constant weights and time-dependent activation function is employed to approximate the time-varying value function which subsequently is utilized to generate the fixed final time near optimal control policy due to NN reconstruction errors. The proposed scheme functions in a forward-in-time manner without offline training phase. The effectiveness of the proposed method is verified via simulation.
Keywords :
approximation theory; discrete time systems; feedback; function approximation; matrix algebra; neurocontrollers; nonlinear control systems; observers; optimal control; partial differential equations; time-varying systems; vectors; Hamilton-Jacobi-Bellman equation; NN reconstruction errors; NN update law; actor-critic structure; affine form; affine nonlinear discrete-time systems; constant weights; control coefficient matrix; fixed final-time near optimal regulation; neural network-based observer; output feedback; system state vector; terminal constraint error minimization; time-dependent activation function; time-varying solution approximation; time-varying value function approximation; unknown system dynamics; value function; Approximation methods; Artificial neural networks; Equations; Observers; Optimal control; Tuning; Vectors; Hamilton-Jacobi-Bellman equation; finite-horizon; neural network; optimal regulation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
American Control Conference (ACC), 2014
Conference_Location :
Portland, OR
ISSN :
0743-1619
Print_ISBN :
978-1-4799-3272-6
Type :
conf
DOI :
10.1109/ACC.2014.6858756
Filename :
6858756
Link To Document :
بازگشت