DocumentCode
1301146
Title
Distributed learning of the global maximum in a two-player stochastic game with identical payoffs
Author
Kumar, P. R Srikanta ; Young, Gia-kinh
Author_Institution
Dept. of Electr., Comput. & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA
Issue
6
fYear
1985
Firstpage
743
Lastpage
753
Abstract
Little is known about the distributed learning of the global maximum in a stochastic framework when there is no communication between the decisionmakers. The case of two decisionmakers is considered, and prior knowledge is assumed about the expected rewards. The asymmetries that may be present in the reward matrix is captured by the prior knowledge. It is shown that each decisionmaker completely unaware of the other converges to the global optimum with arbitrary accuracy over time.
Keywords
Accuracy; Convergence; Cybernetics; Games; Learning automata; Steady-state; Stochastic processes;
fLanguage
English
Journal_Title
Systems, Man and Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
0018-9472
Type
jour
DOI
10.1109/TSMC.1985.6313458
Filename
6313458
Link To Document