DocumentCode :
2856757
Title :
Finite-horizon input-constrained nonlinear optimal control using single network adaptive critics
Author :
Heydari, A. ; Balakrishnan, S.N.
Author_Institution :
Mech. & Aerosp. Eng. Dept., Missouri Univ. of Sci. & Technol., Rolla, MO, USA
fYear :
2011
fDate :
June 29 2011-July 1 2011
Firstpage :
3047
Lastpage :
3052
Abstract :
A single neural network based controller called the Finite-SNAC is developed in this study to synthesize finite-horizon optimal controllers for nonlinear control-affine systems. For satisfying the constraint on the input, a non-quadratic cost function is used. Inputs to the neural network are the current system states and the time-to-go and the network outputs are the costates which are used to compute the feedback control. Convergence of the reinforcement learning based training method to the optimal solution, the training error and the network weights are provided. The resulting controller is shown to solve the associated time-varying Hamilton-Jacobi-Bellman (HJB) equation and provide the fixed-final-time optimal solution. Performance of the new synthesis technique is demonstrated through an attitude control problem wherein a rigid spacecraft performs a finite time attitude maneuver subject to control bounds. The new formulation has a great potential for implementation since it consists of only one neural network with single set of weights and it provides comprehensive feedback solutions online though it is trained offline.
Keywords :
feedback; learning (artificial intelligence); neurocontrollers; nonlinear control systems; optimal control; time-varying systems; HJB equation; attitude control problem; feedback control; finite time attitude maneuver; finite-SNAC; finite-horizon input-constrained control; finite-horizon optimal controller; fixed-final-time optimal solution; neural network based controller; nonlinear control-affine system; nonlinear optimal control; nonquadratic cost function; reinforcement learning based training; single network adaptive critics; spacecraft; time-varying Hamilton-Jacobi-Bellman equation; Convergence; Cost function; Equations; Mathematical model; Optimal control; Satellites; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
American Control Conference (ACC), 2011
Conference_Location :
San Francisco, CA
ISSN :
0743-1619
Print_ISBN :
978-1-4577-0080-4
Type :
conf
DOI :
10.1109/ACC.2011.5991378
Filename :
5991378
Link To Document :
بازگشت