Title :
Payoff-based Inhomogeneous Partially Irrational Play for potential game theoretic cooperative control: Convergence analysis
Author :
Tatsuhiko Goto;Takeshi Hatanaka;Masayuki Fujita
Author_Institution :
Toshiba Corporation
fDate :
6/1/2012 12:00:00 AM
Abstract :
This paper investigates learning algorithm design in potential game theoretic cooperative control, where it is in general required for agents´ collective action to converge to the most efficient equilibria while standard game theory aims at just computing a Nash equilibrium. In particular, the equilibria maximizing the potential function should be selected in case the utility functions are already aligned to a global objective function. In order to meet the requirement, this paper develops a learning algorithm called Payoff-based Inhomogeneous Partially Irrational Play (PIPIP). The main feature of PIPIP is to allow agents to make irrational decisions with a specified probability, i.e. agents can choose an action with a low utility from the past actions stored in the memory. We then prove convergence in probability of the collective action to the potential function maximizers. Finally, the effectiveness of the present algorithm is demonstrated through simulation on a sensor coverage problem.
Keywords :
"Resistance","Games","Markov processes","Equations","Algorithm design and analysis","Nash equilibrium","Nonhomogeneous media"
Conference_Titel :
American Control Conference (ACC), 2012
Print_ISBN :
978-1-4577-1095-7