• DocumentCode
    3618529
  • Title

    On the robustness of two time-scale stochastic approximation algorithms

  • Author

    V.B. Tadic

  • Author_Institution
    Dept. of Autom. Control & Syst. Eng., Univ. of Sheffield, UK
  • Volume
    5
  • fYear
    2004
  • fDate
    6/26/1905 12:00:00 AM
  • Firstpage
    5334
  • Abstract
    Motivated by the problem of the asymptotic behavior of a class of actor-critic learning algorithms recently proposed by Konda and Tsitsiklis, the almost sure asymptotic properties of two time-scale stochastic approximation are analyzed under violated Kushner-Clark noise conditions and very general stability conditions. The analysis covers the algorithms with additive noise, as well as those with nonadditive noise. The algorithms with additive noise are analyzed for the case where the noise is state-dependent. The analysis of the algorithms with non-additive state-dependent noise is carried out for the case where the noise is a Markov chain controlled by the algorithm states, while the algorithms with non-additive exogenous noise are analyzed for the case where the noise is correlated and satisfies uniform or strong mixing conditions. The obtained results characterize the robustness of two time-scale stochastic approximation towards the violation of the Kushner-Clark noise condition. Moreover, they cover a fairly broad class of highly non-linear two time-scale stochastic approximation algorithms, including the actor-critic learning algorithms proposed by Konda and Tsitsiklis.
  • Keywords
    "Stochastic processes","Approximation algorithms","Algorithm design and analysis","Stochastic resonance","Additive noise","Noise robustness","Stochastic systems","Stability analysis","Asymptotic stability","Lyapunov method"
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2004. CDC. 43rd IEEE Conference on
  • ISSN
    0191-2216
  • Print_ISBN
    0-7803-8682-5
  • Type

    conf

  • DOI
    10.1109/CDC.2004.1429656
  • Filename
    1429656