Distributed learning of the global maximum in a two-player stochastic game with identical payoffs

Author

Kumar, P. R Srikanta ; Young, Gia-kinh

Author_Institution

Dept. of Electr., Comput. & Syst. Eng., Rensselaer Polytech. Inst., Troy, NY, USA

Issue

fYear

1985

Firstpage

743

Lastpage

753

Abstract

Little is known about the distributed learning of the global maximum in a stochastic framework when there is no communication between the decisionmakers. The case of two decisionmakers is considered, and prior knowledge is assumed about the expected rewards. The asymmetries that may be present in the reward matrix is captured by the prior knowledge. It is shown that each decisionmaker completely unaware of the other converges to the global optimum with arbitrary accuracy over time.

Keywords

Accuracy; Convergence; Cybernetics; Games; Learning automata; Steady-state; Stochastic processes;

fLanguage

English

Journal_Title

Systems, Man and Cybernetics, IEEE Transactions on

Publisher

ieee

ISSN

0018-9472

Type

jour

DOI

10.1109/TSMC.1985.6313458

Filename

6313458

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=1301146