DocumentCode :
643040
Title :
Stability of the Nash equilibrium under gradient ascent learning algorithms in two-agent two-action games
Author :
Bhaya, Amit ; Brandolt Sodre de Macedo, Rodrigo ; Shiguemitsu Shigueoka, Lucas
Author_Institution :
Dept. of Electr. Eng., Fed. Univ. of Rio de Janeiro, Rio de Janeiro, Brazil
fYear :
2013
fDate :
28-30 Aug. 2013
Firstpage :
837
Lastpage :
842
Abstract :
This paper provides a unified view and stability analysis of reinforcement learning algorithms for general sum games that have been proposed in the literature. Specifically, the gradient ascent learning algorithms proposed by Singh, Kearns and Mansour, and the variant proposed by Bowling and Veloso are shown to lead to convergence to the Nash equilibrium, using a switching control viewpoint and providing a unified Lyapunov function analysis. Furthermore, a proof of stability of the Nash equilibrium under the weighted policy learning (WPL) algorithm, which was proposed, without formal proof, by Abdallah and Lesser, is also arrived at using a Liapunov function approach and involves the novel feature of an analysis of the virtual equilibrium points. The importance of providing a stability proof for WPL dynamics is that the latter allows agents to reach a Nash equilibrium in two-agent, two-action games in which the only feedback that an agent needs is its own reward, and no agent uses knowledge of the rewards or actions of other agents, or any a priori information on the location of the Nash equilibrium.
Keywords :
game theory; gradient methods; learning (artificial intelligence); stability; Lyapunov function analysis; Nash equilibrium stability; WPL algorithm; general sum games; gradient ascent learning algorithms; reinforcement learning algorithms; stability analysis; switching control viewpoint; two-agent two-action games; virtual equilibrium points; weighted policy learning; Convergence; Games; Nash equilibrium; Stability analysis; Switches; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Applications (CCA), 2013 IEEE International Conference on
Conference_Location :
Hyderabad
ISSN :
1085-1992
Type :
conf
DOI :
10.1109/CCA.2013.6662854
Filename :
6662854
Link To Document :
بازگشت