Title :
Approximate Dynamic Programming Based on Expansive Projections
Author :
Arruda, Edilson F. ; Val, João B R do
Author_Institution :
Center for Syst. & Control, National Lab. for Sci. Comput., Petropolis
Abstract :
We present a general method to obtain convergent approximate value iteration algorithms with function approximation. The result is applicable to any arbitrary approximation architecture and generalizes existing results in the literature derived for particular approximation schemes. Additionally, we show how to obtain a convergent approximate mapping whose fixed point is the projection in the approximation space of a fixed point of the exact dynamic programming mapping with regards to a suitable subset norm. This result relies on evaluating the difference between successive iterates in the selected subset norm, which provides convergent procedures for any arbitrary approximation architecture
Keywords :
approximation theory; convergence; dynamic programming; function approximation; iterative methods; approximate dynamic programming; convergent approximate value iteration algorithms; expansive projections; function approximation; Approximation algorithms; Computer architecture; Convergence; Dynamic programming; Function approximation; Heuristic algorithms; Large-scale systems; Monitoring; State-space methods; USA Councils;
Conference_Titel :
Decision and Control, 2006 45th IEEE Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
1-4244-0171-2
DOI :
10.1109/CDC.2006.376823