• DocumentCode
    3653581
  • Title

    PNLMS-based Algorithm for Online Approximated Solution of HJB Equation in the Context of Discrete MIMO Optimal Control and Reinforcement Learning

  • Author

    Márcio Eduardo G. ;João Viana da Fonseca ;Francisco das Chagas de Souza

  • Author_Institution
    Dept. of Electr. Eng., Fed. Univ. of Maranhao, Sã
  • fYear
    2014
  • fDate
    3/1/2014 12:00:00 AM
  • Firstpage
    69
  • Lastpage
    76
  • Abstract
    In the context of discrete multivariable optimal linear quadratic regulator and reinforcement learning, this paper is attained to an investigation on PNLMS algorithm for online approximated solution of Hamilton-Jacobi-Bellman Equation. The PNLMS-based algorithms is designed to estimate the parameters of value function that is guided by Heuristic Dynamic Programming approach. The problem is characterized as approximation problem of cost value function that is computed without dynamic system model, but only by the measurements of states and action of interaction between the real world system and controller engine design. The proposed PNLMS-based algorithm is evaluated in a simplified mathematical model of the longitudinal dynamics of an aircraft that is a multivariable system. The performance of PNLMS-based algorithm is compared with those of RLS-based and LMS-based algorithms. These comparisons are performed in terms of convergence aspects and computational cost for online implementations of optimal design methodlogy.
  • Keywords
    "Mathematical model","Equations","Heuristic algorithms","Algorithm design and analysis","Least squares approximations","Aircraft","Atmospheric modeling"
  • Publisher
    ieee
  • Conference_Titel
    Computer Modelling and Simulation (UKSim), 2014 UKSim-AMSS 16th International Conference on
  • Type

    conf

  • DOI
    10.1109/UKSim.2014.108
  • Filename
    7046041