DocumentCode
3653581
Title
PNLMS-based Algorithm for Online Approximated Solution of HJB Equation in the Context of Discrete MIMO Optimal Control and Reinforcement Learning
Author
Márcio Eduardo G. ;João Viana da Fonseca ;Francisco das Chagas de Souza
Author_Institution
Dept. of Electr. Eng., Fed. Univ. of Maranhao, Sã
fYear
2014
fDate
3/1/2014 12:00:00 AM
Firstpage
69
Lastpage
76
Abstract
In the context of discrete multivariable optimal linear quadratic regulator and reinforcement learning, this paper is attained to an investigation on PNLMS algorithm for online approximated solution of Hamilton-Jacobi-Bellman Equation. The PNLMS-based algorithms is designed to estimate the parameters of value function that is guided by Heuristic Dynamic Programming approach. The problem is characterized as approximation problem of cost value function that is computed without dynamic system model, but only by the measurements of states and action of interaction between the real world system and controller engine design. The proposed PNLMS-based algorithm is evaluated in a simplified mathematical model of the longitudinal dynamics of an aircraft that is a multivariable system. The performance of PNLMS-based algorithm is compared with those of RLS-based and LMS-based algorithms. These comparisons are performed in terms of convergence aspects and computational cost for online implementations of optimal design methodlogy.
Keywords
"Mathematical model","Equations","Heuristic algorithms","Algorithm design and analysis","Least squares approximations","Aircraft","Atmospheric modeling"
Publisher
ieee
Conference_Titel
Computer Modelling and Simulation (UKSim), 2014 UKSim-AMSS 16th International Conference on
Type
conf
DOI
10.1109/UKSim.2014.108
Filename
7046041
Link To Document