DocumentCode :
183659
Title :
Neural network-based finite-horizon approximately optimal control of uncertain affine nonlinear continuous-time systems
Author :
Hao Xu ; Qiming Zhao ; Dierks, Travis ; Jagannathan, Sarangapani
Author_Institution :
Dept. of Electr. & Comput. Eng., Missouri Univ. of Sci. & Technol., Rolla, MO, USA
fYear :
2014
fDate :
4-6 June 2014
Firstpage :
1243
Lastpage :
1248
Abstract :
This paper develops a novel neural network (NN) based finite-horizon approximate optimal control of nonlinear continuous-time systems in affine form when the system dynamics are complete unknown. First an online NN identifier is proposed to learn the dynamics of the nonlinear continuous-time system. Subsequently, a second NN is utilized to learn the time-varying solution, or referred to as value function, of the Hamilton-Jacobi-Bellman (HJB) equation in an online and forward in time manner. Then, by using the estimated time-varying value function from the second NN and control coefficient matrix from the NN identifier, an approximate optimal control input is computed. To handle time varying value function, a NN with constant weights and time-varying activation function is considered and a suitable NN update law is derived based on normalized gradient descent approach. Further, in order to satisfy terminal constraint and ensure stability within the fixed final time, two extra terms, one corresponding to terminal constraint, and the other to stabilize the nonlinear system are added to the novel update law of the second NN. No initial stabilizing control is required. A uniformly ultimately boundedness of the closed-loop system is verified by using standard Lyapunov theory.
Keywords :
Lyapunov methods; closed loop systems; continuous time systems; gradient methods; neurocontrollers; nonlinear control systems; optimal control; stability; transfer functions; uncertain systems; HJB equation; Hamilton-Jacobi-Bellman equation; closed-loop system; control coefficient matrix; neural network-based finite-horizon approximately optimal control; normalized gradient descent approach; online NN identifier; stabilization; standard Lyapunov theory; system dynamics; terminal constraint; time-varying activation function; time-varying value function; uncertain affine nonlinear continuous-time systems; Artificial neural networks; Equations; Function approximation; Nonlinear dynamical systems; Optimal control; Hamilton-Jacobi-Bellman equation; approximate optimal control; finite-horizon; neural network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
American Control Conference (ACC), 2014
Conference_Location :
Portland, OR
ISSN :
0743-1619
Print_ISBN :
978-1-4799-3272-6
Type :
conf
DOI :
10.1109/ACC.2014.6858693
Filename :
6858693
Link To Document :
بازگشت