• DocumentCode
    1301146
  • Title

    Distributed learning of the global maximum in a two-player stochastic game with identical payoffs

  • Author

    Kumar, P. R Srikanta ; Young, Gia-kinh

  • Author_Institution
    Dept. of Electr., Comput. & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA
  • Issue
    6
  • fYear
    1985
  • Firstpage
    743
  • Lastpage
    753
  • Abstract
    Little is known about the distributed learning of the global maximum in a stochastic framework when there is no communication between the decisionmakers. The case of two decisionmakers is considered, and prior knowledge is assumed about the expected rewards. The asymmetries that may be present in the reward matrix is captured by the prior knowledge. It is shown that each decisionmaker completely unaware of the other converges to the global optimum with arbitrary accuracy over time.
  • Keywords
    Accuracy; Convergence; Cybernetics; Games; Learning automata; Steady-state; Stochastic processes;
  • fLanguage
    English
  • Journal_Title
    Systems, Man and Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9472
  • Type

    jour

  • DOI
    10.1109/TSMC.1985.6313458
  • Filename
    6313458