مرکز منطقه ای اطلاع رساني علوم و فناوري - Distributed asynchronous policy iteration in dynamic programming

DocumentCode :

2423132

Title :

Distributed asynchronous policy iteration in dynamic programming

Author :

Bertsekas, Dimitri P. ; Yu, Huizhen

Author_Institution :

Dept. of Electr. Eng. & Comp., M.I.T., Cambridge, MA, USA

fYear :

2010

fDate :

Sept. 29 2010-Oct. 1 2010

Firstpage :

1368

Lastpage :

1375

Abstract :

We consider the distributed solution of dynamic programming (DP) problems by policy iteration. We envision a network of processors, each updating asynchronously a local policy and a local cost function, defined on a portion of the state space. The computed values are communicated asynchronously between processors and are used to perform the local policy and cost updates. The natural algorithm of this type can fail even under favorable circumstances, as shown by Williams and Baird [WiB93]. We propose an alternative and almost as simple algorithm, which converges to the optimum under the most general conditions, including asynchronous updating by multiple processors using outdated local cost functions of other processors.

Keywords :

dynamic programming; multiprocessing systems; parallel processing; asynchronous updating; distributed asynchronous policy iteration; dynamic programming; local cost function; multiple processors; processors network; state space; Convergence; Cost function; Delay; Equations; Mathematical model; Program processors; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Communication, Control, and Computing (Allerton), 2010 48th Annual Allerton Conference on

Conference_Location :

Allerton, IL

Print_ISBN :

978-1-4244-8215-3

Type :

conf

DOI :

10.1109/ALLERTON.2010.5707073

Filename :

5707073

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2423132