Title :
Policy iteration algorithm for distributed networks and graphical games
Author :
Vamvoudakis, Kyriakos G. ; Lewis, F.L.
Author_Institution :
Autom. & Robot. Res. Inst., Univ. of Texas at Arlington, Fort Worth, TX, USA
Abstract :
This paper brings together cooperative control, reinforcement learning, and game theory to present a multi-agent distributed formulation for graphical games. The notion of graphical games is developed for dynamical systems, where the dynamics and performance indices for each node depend only on local neighbor information. We propose a cooperative policy iteration algorithm for graphical games. This algorithm converges to the best response when the neighbors of each agent do not update their policies and to the Nash equilibrium when all agents update their policies simultaneously. It is also shown that the convergence of this algorithm is based on the speed of convergence of the neighbors of every player in the graph, graph topology, and user defined matrices in the performance index. This framework will be used to develop methods for online adaptive learning solutions of graphical games in real time.
Keywords :
computer games; computer graphics; game theory; graph theory; iterative methods; learning (artificial intelligence); matrix algebra; multi-agent systems; Nash equilibrium; cooperative control; cooperative policy iteration algorithm; distributed network; dynamical systems; game theory; graph topology; graphical game; local neighbor information; multiagent distributed formulation; online adaptive learning solution; reinforcement learning; user defined matrix; Equations; Games; Heuristic algorithms; Nash equilibrium; Performance analysis; Synchronization; Vectors; Nash-equilibrium; Policy Iteration; best response; cooperative Hamilton-Jacobi equations; graphical games;
Conference_Titel :
Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-61284-800-6
Electronic_ISBN :
0743-1546
DOI :
10.1109/CDC.2011.6160491