مرکز منطقه ای اطلاع رساني علوم و فناوري - Finite-horizon input-constrained nonlinear optimal control using single network adaptive critics

DocumentCode :

2856757

Title :

Finite-horizon input-constrained nonlinear optimal control using single network adaptive critics

Author :

Heydari, A. ; Balakrishnan, S.N.

Author_Institution :

Mech. & Aerosp. Eng. Dept., Missouri Univ. of Sci. & Technol., Rolla, MO, USA

fYear :

2011

fDate :

June 29 2011-July 1 2011

Firstpage :

3047

Lastpage :

3052

Abstract :

A single neural network based controller called the Finite-SNAC is developed in this study to synthesize finite-horizon optimal controllers for nonlinear control-affine systems. For satisfying the constraint on the input, a non-quadratic cost function is used. Inputs to the neural network are the current system states and the time-to-go and the network outputs are the costates which are used to compute the feedback control. Convergence of the reinforcement learning based training method to the optimal solution, the training error and the network weights are provided. The resulting controller is shown to solve the associated time-varying Hamilton-Jacobi-Bellman (HJB) equation and provide the fixed-final-time optimal solution. Performance of the new synthesis technique is demonstrated through an attitude control problem wherein a rigid spacecraft performs a finite time attitude maneuver subject to control bounds. The new formulation has a great potential for implementation since it consists of only one neural network with single set of weights and it provides comprehensive feedback solutions online though it is trained offline.

Keywords :

feedback; learning (artificial intelligence); neurocontrollers; nonlinear control systems; optimal control; time-varying systems; HJB equation; attitude control problem; feedback control; finite time attitude maneuver; finite-SNAC; finite-horizon input-constrained control; finite-horizon optimal controller; fixed-final-time optimal solution; neural network based controller; nonlinear control-affine system; nonlinear optimal control; nonquadratic cost function; reinforcement learning based training; single network adaptive critics; spacecraft; time-varying Hamilton-Jacobi-Bellman equation; Convergence; Cost function; Equations; Mathematical model; Optimal control; Satellites; Training;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

American Control Conference (ACC), 2011

Conference_Location :

San Francisco, CA

ISSN :

0743-1619

Print_ISBN :

978-1-4577-0080-4

Type :

conf

DOI :

10.1109/ACC.2011.5991378

Filename :

5991378

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2856757