• DocumentCode
    3526683
  • Title

    Solving multichain stochastic games with mean payoff by policy iteration

  • Author

    Akian, Marianne ; Cochet-Terrasson, Jean ; Detournay, Sylvie ; Gaubert, Stephane

  • Author_Institution
    CMAP, EINRIA Saclay-Ile-de-France, Palaiseau, France
  • fYear
    2013
  • fDate
    10-13 Dec. 2013
  • Firstpage
    1834
  • Lastpage
    1841
  • Abstract
    Zero-sum stochastic games with finite state and action spaces, perfect information, and mean payoff criteria arise in particular from the monotone discretization of mean-payoff pursuit-evasion deterministic differential games. In that case no irreducibility assumption on the Markov chains associated to strategies are satisfied (multichain games). The value of such a game can be characterized by a system of nonlinear equations, involving the mean payoff vector and an auxiliary vector (relative value or bias). Cochet-Terrasson and Gaubert proposed in (C. R. Math. Acad. Sci. Paris, 2006) a policy iteration algorithm relying on a notion of nonlinear spectral projection (Akian and Gaubert, Nonlinear Analysis TMA, 2003), which allows one to avoid cycling in degenerate iterations. We give here a complete presentation of the algorithm, with details of implementation in particular of the nonlinear projection. This has led to the software PIGAMES and allowed us to present numerical results on pursuit-evasion games.
  • Keywords
    Markov processes; iterative methods; nonlinear equations; stochastic games; vectors; Markov chains; PIGAMES software; action spaces; auxiliary vector; finite state; mean payoff criteria; mean payoff vector; mean-payoff pursuit-evasion deterministic differential games; monotone discretization; multichain stochastic games; nonlinear projection; nonlinear spectral projection; perfect information; policy iteration algorithm; system-of-nonlinear equations; zero-sum stochastic games; Bismuth; Dynamic programming; Equations; Games; Harmonic analysis; Software algorithms; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control (CDC), 2013 IEEE 52nd Annual Conference on
  • Conference_Location
    Firenze
  • ISSN
    0743-1546
  • Print_ISBN
    978-1-4673-5714-2
  • Type

    conf

  • DOI
    10.1109/CDC.2013.6760149
  • Filename
    6760149