DocumentCode
643040
Title
Stability of the Nash equilibrium under gradient ascent learning algorithms in two-agent two-action games
Author
Bhaya, Amit ; Brandolt Sodre de Macedo, Rodrigo ; Shiguemitsu Shigueoka, Lucas
Author_Institution
Dept. of Electr. Eng., Fed. Univ. of Rio de Janeiro, Rio de Janeiro, Brazil
fYear
2013
fDate
28-30 Aug. 2013
Firstpage
837
Lastpage
842
Abstract
This paper provides a unified view and stability analysis of reinforcement learning algorithms for general sum games that have been proposed in the literature. Specifically, the gradient ascent learning algorithms proposed by Singh, Kearns and Mansour, and the variant proposed by Bowling and Veloso are shown to lead to convergence to the Nash equilibrium, using a switching control viewpoint and providing a unified Lyapunov function analysis. Furthermore, a proof of stability of the Nash equilibrium under the weighted policy learning (WPL) algorithm, which was proposed, without formal proof, by Abdallah and Lesser, is also arrived at using a Liapunov function approach and involves the novel feature of an analysis of the virtual equilibrium points. The importance of providing a stability proof for WPL dynamics is that the latter allows agents to reach a Nash equilibrium in two-agent, two-action games in which the only feedback that an agent needs is its own reward, and no agent uses knowledge of the rewards or actions of other agents, or any a priori information on the location of the Nash equilibrium.
Keywords
game theory; gradient methods; learning (artificial intelligence); stability; Lyapunov function analysis; Nash equilibrium stability; WPL algorithm; general sum games; gradient ascent learning algorithms; reinforcement learning algorithms; stability analysis; switching control viewpoint; two-agent two-action games; virtual equilibrium points; weighted policy learning; Convergence; Games; Nash equilibrium; Stability analysis; Switches; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Control Applications (CCA), 2013 IEEE International Conference on
Conference_Location
Hyderabad
ISSN
1085-1992
Type
conf
DOI
10.1109/CCA.2013.6662854
Filename
6662854
Link To Document