DocumentCode :
183507
Title :
Proving convergence of log-linear learning in potential games
Author :
Tatarenko, Tatiana
Author_Institution :
Dept. of Control Theor. & Robot., Tech. Univ. Darmstadt, Darmstadt, Germany
fYear :
2014
fDate :
4-6 June 2014
Firstpage :
972
Lastpage :
977
Abstract :
In this paper, we provide a theoretical analysis of log-linear learning algorithm that can be used for studying decision processes and solving control problems related to potential games. So far, for this algorithm the convergence of collective behavior to some state in a potential game has been proven to be specified by a chosen parameter and does not imply the convergence in probability to potential function maximizers. Tending the parameter and time to infinity simultaneously, we investigate the probabilistic convergence of joint actions based on the log-linear learning algorithm. We formulate conditions under which the convergence does not take place at all. Nevertheless, we explain how the parameter and utility functions should be designed to guarantee the probabilistic convergence of system behavior with some stationary distribution. Moreover, for such a setup in potential games the stable states, i.e. those having positive probability in the limit stationary distribution, are from the set of Nash equilibria and maximize the potential function.
Keywords :
convergence; game theory; learning (artificial intelligence); mathematics computing; Nash equilibria; collective behavior convergence; limit stationary distribution; log-linear learning algorithm; parameter functions; potential games; probabilistic convergence; stable states; stationary distribution; utility functions; Algorithm design and analysis; Convergence; Games; Heuristic algorithms; Joints; Markov processes; Nash equilibrium; Agents-based systems; Decentralized control; Learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
American Control Conference (ACC), 2014
Conference_Location :
Portland, OR
ISSN :
0743-1619
Print_ISBN :
978-1-4799-3272-6
Type :
conf
DOI :
10.1109/ACC.2014.6858606
Filename :
6858606
Link To Document :
بازگشت