مرکز منطقه ای اطلاع رساني علوم و فناوري - Receding horizon approach to Markov games for infinite horizon discounted cost

DocumentCode :

391237

Title :

Receding horizon approach to Markov games for infinite horizon discounted cost

Author :

Chang, Hyeong Soo ; Marcus, Steven I.

Author_Institution :

Dept. & Inst. for Syst. Res., Maryland Univ., College Park, MD, USA

Volume :

fYear :

2002

fDate :

10-13 Dec. 2002

Firstpage :

1380

Abstract :

We consider a receding horizon approach as an approximate solution to two-person zero-sum Markov games with an infinite horizon discounted cost criterion. We first present error bounds from the optimal equilibrium value of the game when both players take "correlated" receding horizon policies that are based on exact or approximate solutions of receding finite horizon sub-games. Motivated by the worst-case optimal control of queueing systems by Altman (1995), we then analyze error bounds when the minimizer plays the (approximate) receding horizon control and the maximizer plays the worst case policy. We give two heuristic examples of the approximate receding horizon control. We extend "parallel rollout" and "hindsight optimization" by Chang et al. (2001, 2000) into the Markov game setting within the framework of the approximate receding horizon approach and analyze their performances. From the parallel rollout approach, the minimizing player seeks to combine dynamically multiple heuristic policies in a set to improve the performances of all of the heuristic policies simultaneously under the guess that the maximizing player has chosen a fixed worst-case policy. Given ε > 0, we give the value of the receding horizon which guarantees that the parallel rollout policy with the horizon played by the minimizer "dominates" any heuristic policy in the set by e. From the hindsight optimization approach, the minimizing player makes a decision based on his expected optimal hindsight performance over a finite horizon. We finally discuss practical implementations of the receding horizon approaches via simulation.

Keywords :

Markov processes; game theory; infinite horizon; decision making; error bounds; game theory; heuristic policy; multiple heuristic policies; optimal control; optimal equilibrium; parallel rollout; receding finite horizon; receding horizon; receding horizon control; two-person Markov games; zero-sum Markov games; Control systems; Costs; Educational institutions; Error analysis; Game theory; Infinite horizon; Optimal control; Postal services; Queueing analysis; Telecommunication control;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control, 2002, Proceedings of the 41st IEEE Conference on

ISSN :

0191-2216

Print_ISBN :

0-7803-7516-5

Type :

conf

DOI :

10.1109/CDC.2002.1184710

Filename :

1184710

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=391237