• DocumentCode
    2423132
  • Title

    Distributed asynchronous policy iteration in dynamic programming

  • Author

    Bertsekas, Dimitri P. ; Yu, Huizhen

  • Author_Institution
    Dept. of Electr. Eng. & Comp., M.I.T., Cambridge, MA, USA
  • fYear
    2010
  • fDate
    Sept. 29 2010-Oct. 1 2010
  • Firstpage
    1368
  • Lastpage
    1375
  • Abstract
    We consider the distributed solution of dynamic programming (DP) problems by policy iteration. We envision a network of processors, each updating asynchronously a local policy and a local cost function, defined on a portion of the state space. The computed values are communicated asynchronously between processors and are used to perform the local policy and cost updates. The natural algorithm of this type can fail even under favorable circumstances, as shown by Williams and Baird [WiB93]. We propose an alternative and almost as simple algorithm, which converges to the optimum under the most general conditions, including asynchronous updating by multiple processors using outdated local cost functions of other processors.
  • Keywords
    dynamic programming; multiprocessing systems; parallel processing; asynchronous updating; distributed asynchronous policy iteration; dynamic programming; local cost function; multiple processors; processors network; state space; Convergence; Cost function; Delay; Equations; Mathematical model; Program processors; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication, Control, and Computing (Allerton), 2010 48th Annual Allerton Conference on
  • Conference_Location
    Allerton, IL
  • Print_ISBN
    978-1-4244-8215-3
  • Type

    conf

  • DOI
    10.1109/ALLERTON.2010.5707073
  • Filename
    5707073