Title of article
The Borkar–Meyn theorem for asynchronous stochastic approximations
Author/Authors
Bhatnagar، نويسنده , , Shalabh، نويسنده ,
Issue Information
ماهنامه با شماره پیاپی سال 2011
Pages
7
From page
472
To page
478
Abstract
In this paper, we give a generalization of a result by Borkar and Meyn (2000) [1], on the stability and convergence of synchronous-update stochastic approximation algorithms, to the case of asynchronous stochastic approximations with delays. We then describe an interesting application of the result to asynchronous distributed temporal difference (TD) learning with function approximation and delays.
Keywords
The Borkar–Meyn theorem , Asynchronous stochastic approximation with delays , Temporal difference learning
Journal title
Systems and Control Letters
Serial Year
2011
Journal title
Systems and Control Letters
Record number
1675751
Link To Document