• DocumentCode
    643040
  • Title

    Stability of the Nash equilibrium under gradient ascent learning algorithms in two-agent two-action games

  • Author

    Bhaya, Amit ; Brandolt Sodre de Macedo, Rodrigo ; Shiguemitsu Shigueoka, Lucas

  • Author_Institution
    Dept. of Electr. Eng., Fed. Univ. of Rio de Janeiro, Rio de Janeiro, Brazil
  • fYear
    2013
  • fDate
    28-30 Aug. 2013
  • Firstpage
    837
  • Lastpage
    842
  • Abstract
    This paper provides a unified view and stability analysis of reinforcement learning algorithms for general sum games that have been proposed in the literature. Specifically, the gradient ascent learning algorithms proposed by Singh, Kearns and Mansour, and the variant proposed by Bowling and Veloso are shown to lead to convergence to the Nash equilibrium, using a switching control viewpoint and providing a unified Lyapunov function analysis. Furthermore, a proof of stability of the Nash equilibrium under the weighted policy learning (WPL) algorithm, which was proposed, without formal proof, by Abdallah and Lesser, is also arrived at using a Liapunov function approach and involves the novel feature of an analysis of the virtual equilibrium points. The importance of providing a stability proof for WPL dynamics is that the latter allows agents to reach a Nash equilibrium in two-agent, two-action games in which the only feedback that an agent needs is its own reward, and no agent uses knowledge of the rewards or actions of other agents, or any a priori information on the location of the Nash equilibrium.
  • Keywords
    game theory; gradient methods; learning (artificial intelligence); stability; Lyapunov function analysis; Nash equilibrium stability; WPL algorithm; general sum games; gradient ascent learning algorithms; reinforcement learning algorithms; stability analysis; switching control viewpoint; two-agent two-action games; virtual equilibrium points; weighted policy learning; Convergence; Games; Nash equilibrium; Stability analysis; Switches; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Applications (CCA), 2013 IEEE International Conference on
  • Conference_Location
    Hyderabad
  • ISSN
    1085-1992
  • Type

    conf

  • DOI
    10.1109/CCA.2013.6662854
  • Filename
    6662854