عنوان مقاله :
ForSts: Tacit Collusion in the Repeated Non-Cooperative Games Using Forwarding N-Steps Reinforcement Learning Algorithm
پديد آورندگان :
Golzari Hormozi ، Amin K. N. Toosi University of Technology - Faculty of Computer Engineering , Khasteh ، Hossein K. N. Toosi University of Technology - Faculty of Computer Engineering , Nikoofard ، Amir hossein K. N. Toosi University of Technology - Faculty of Electrical Engineering , Shirmohammadi ، Zahra Shahid Rajaee Teacher Training University - Faculty of Computer Engineering
كليدواژه :
Cournot , Electricity market , Nash equilibrium , Non , cooperative repeated games , Prisoner’s Dilemma , Reinforcement learning
چكيده فارسي :
In the game theory, the wellknown solution to obtain the best profit in nonrepeated games as much as possible is the Nash equilibrium. However, in some repeated noncooperative games, agents can achieve more profit than the Nash equilibrium by tacit collusion. One of the methods to achieve profit more than Nash equilibriums in tacit collusion is reinforcement learning. However, reinforcement learningbased methods consider only one step in the learning process. To achieve and improve profit in these games, more than one step can be used. In this regard, a learningbased forwarding Nsteps algorithm called Forwarding Steps (ForSts) is proposed in this paper. The main idea behind ForSts is to improve the performance of agents in noncooperative games by observing the last Nstep rewards. As ForSts is used in the game theory to learn tacit collusion, it is evaluated by the iterated prisoner’s dilemma and the Cournot market. Prisoner’s Dilemma is an example of a traditional game. The results show that in the iterated prisoner’s dilemma, the agents using ForSts achieve better profit than the agents playing in the Nash equilibrium. Also, in the Cournot electricity market, sum of the profit of agents using ForSts is 3.614% more than the sum of profit of agents` playing in the Nash equilibrium.
عنوان نشريه :
هوش محاسباتي در مهندسي برق
عنوان نشريه :
هوش محاسباتي در مهندسي برق