DocumentCode
3782767
Title
Asymptotic analysis of temporal-difference learning algorithms with linear function approximation
Author
V. Tadic
Author_Institution
Mihajlo Pupin Inst., Belgrade, Serbia
Volume
5
fYear
1999
Firstpage
5050
Abstract
The asymptotic properties of temporal-difference learning algorithms with linear function approximation are analyzed in the paper. The analysis is carried out in the context of the approximation of a discounted cost-to-go function associated to an uncontrolled Markov chain with an uncountable finite-dimensional state-space.
Keywords
"Algorithm design and analysis","Approximation algorithms","Function approximation","Convergence","Difference equations","Random variables"
Publisher
ieee
Conference_Titel
Decision and Control, 1999. Proceedings of the 38th IEEE Conference on
ISSN
0191-2216
Print_ISBN
0-7803-5250-5
Type
conf
DOI
10.1109/CDC.1999.833350
Filename
833350
Link To Document