Title :
On the robustness of two time-scale stochastic approximation algorithms
Author_Institution :
Dept. of Autom. Control & Syst. Eng., Univ. of Sheffield, UK
fDate :
6/26/1905 12:00:00 AM
Abstract :
Motivated by the problem of the asymptotic behavior of a class of actor-critic learning algorithms recently proposed by Konda and Tsitsiklis, the almost sure asymptotic properties of two time-scale stochastic approximation are analyzed under violated Kushner-Clark noise conditions and very general stability conditions. The analysis covers the algorithms with additive noise, as well as those with nonadditive noise. The algorithms with additive noise are analyzed for the case where the noise is state-dependent. The analysis of the algorithms with non-additive state-dependent noise is carried out for the case where the noise is a Markov chain controlled by the algorithm states, while the algorithms with non-additive exogenous noise are analyzed for the case where the noise is correlated and satisfies uniform or strong mixing conditions. The obtained results characterize the robustness of two time-scale stochastic approximation towards the violation of the Kushner-Clark noise condition. Moreover, they cover a fairly broad class of highly non-linear two time-scale stochastic approximation algorithms, including the actor-critic learning algorithms proposed by Konda and Tsitsiklis.
Keywords :
"Stochastic processes","Approximation algorithms","Algorithm design and analysis","Stochastic resonance","Additive noise","Noise robustness","Stochastic systems","Stability analysis","Asymptotic stability","Lyapunov method"
Conference_Titel :
Decision and Control, 2004. CDC. 43rd IEEE Conference on
Print_ISBN :
0-7803-8682-5
DOI :
10.1109/CDC.2004.1429656