DocumentCode :
3618529
Title :
On the robustness of two time-scale stochastic approximation algorithms
Author :
V.B. Tadic
Author_Institution :
Dept. of Autom. Control & Syst. Eng., Univ. of Sheffield, UK
Volume :
5
fYear :
2004
fDate :
6/26/1905 12:00:00 AM
Firstpage :
5334
Abstract :
Motivated by the problem of the asymptotic behavior of a class of actor-critic learning algorithms recently proposed by Konda and Tsitsiklis, the almost sure asymptotic properties of two time-scale stochastic approximation are analyzed under violated Kushner-Clark noise conditions and very general stability conditions. The analysis covers the algorithms with additive noise, as well as those with nonadditive noise. The algorithms with additive noise are analyzed for the case where the noise is state-dependent. The analysis of the algorithms with non-additive state-dependent noise is carried out for the case where the noise is a Markov chain controlled by the algorithm states, while the algorithms with non-additive exogenous noise are analyzed for the case where the noise is correlated and satisfies uniform or strong mixing conditions. The obtained results characterize the robustness of two time-scale stochastic approximation towards the violation of the Kushner-Clark noise condition. Moreover, they cover a fairly broad class of highly non-linear two time-scale stochastic approximation algorithms, including the actor-critic learning algorithms proposed by Konda and Tsitsiklis.
Keywords :
"Stochastic processes","Approximation algorithms","Algorithm design and analysis","Stochastic resonance","Additive noise","Noise robustness","Stochastic systems","Stability analysis","Asymptotic stability","Lyapunov method"
Publisher :
ieee
Conference_Titel :
Decision and Control, 2004. CDC. 43rd IEEE Conference on
ISSN :
0191-2216
Print_ISBN :
0-7803-8682-5
Type :
conf
DOI :
10.1109/CDC.2004.1429656
Filename :
1429656
Link To Document :
بازگشت