DocumentCode :
3645983
Title :
Asymptotic bias of stochastic gradient search
Author :
Vladislav B. Tadić;A. Doucet
Author_Institution :
Department of Mathematics, University of Bristol, BS8 1TW, United Kingdom
fYear :
2011
Firstpage :
722
Lastpage :
727
Abstract :
The asymptotic behavior of the stochastic gradient algorithm with a biased gradient estimator is analyzed. Relying on arguments based on differential geometry (Yomdin theorem and Lojasiewicz inequality), relatively tight bounds on the asymptotic bias of the iterates generated by such an algorithm are derived. The obtained results hold under mild and verifiable conditions and cover a broad class of complex stochastic gradient algorithms. Using these results, the asymptotic properties of the actor-critic reinforcement learning are studied.
Keywords :
"Signal processing algorithms","Estimation","Approximation methods","Learning","Markov processes","Approximation algorithms"
Publisher :
ieee
Conference_Titel :
Decision and Control and European Control Conference (CDC-ECC), 2011 50th IEEE Conference on
ISSN :
0191-2216
Print_ISBN :
978-1-61284-800-6
Type :
conf
DOI :
10.1109/CDC.2011.6160812
Filename :
6160812
Link To Document :
بازگشت