• DocumentCode
    189354
  • Title

    Differential graphical games: Policy iteration solutions and coupled Riccati formulation

  • Author

    Abouheaf, Mohammed I. ; Lewis, Frank L. ; Mahmoud, Magdi S.

  • Author_Institution
    Syst. Eng. Dept., King Fahd Univ. of Pet. & Miner., Dhahran, Saudi Arabia
  • fYear
    2014
  • fDate
    24-27 June 2014
  • Firstpage
    1594
  • Lastpage
    1599
  • Abstract
    This paper introduces novel Integral Reinforcement Learning solution to a class of differential games known as differential graphical games. The agents´ error dynamics are coupled dynamical systems driven by the control input of each agent and the control inputs of its neighbors. A new class of control policies is developed to solve the differential graphical games with innovative performance index which is used to measure the system performance. The graphical game Integral Reinforcement Learning Bellman equations are shown to be equivalent to certain graphical game coupled Hamilton-Jacobi-Bellman equations developed herein. Online Policy Iteration algorithm is proposed to solve the differential graphical game in real-time. Convergence of the policy iteration algorithm is shown under mild assumptions about the inter-connectivity properties of the graph. Novel coupled Riccati formulation is developed to solve the differential graphical games.
  • Keywords
    Riccati equations; computer games; graph theory; iterative methods; learning (artificial intelligence); Hamilton-Jacobi-Bellman equation; agents error dynamics; coupled Riccati formulation; coupled dynamical system; differential graphical games; innovative performance index; integral reinforcement learning solution; online policy iteration algorithm; policy iteration solution; Equations; Games; Nash equilibrium; Optimal control; Performance analysis; Synchronization; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Conference (ECC), 2014 European
  • Conference_Location
    Strasbourg
  • Print_ISBN
    978-3-9524269-1-3
  • Type

    conf

  • DOI
    10.1109/ECC.2014.6862473
  • Filename
    6862473