DocumentCode :
2947227
Title :
New error bounds for approximations from projected linear equations
Author :
Yu, Huizhen ; Bertsekas, Dimitri P.
Author_Institution :
Dept. of Comput. Sci., Univ. of Helsinki, Helsinki
fYear :
2008
fDate :
23-26 Sept. 2008
Firstpage :
1116
Lastpage :
1123
Abstract :
We consider linear fixed point equations and their approximations by projection on a low dimensional subspace. We derive new bounds on the approximation error of the solution, which are expressed in terms of low dimensional matrices and can be computed by simulation. When the fixed point mapping is a contraction, as is typically the case in Markovian decision processes (MDP), one of our bounds is always sharper than the standard worst case bounds, and another one is often sharper. Our bounds also apply to the non-contraction case, including policy evaluation in MDP with nonstandard projections that enhance exploration. There are no error bounds currently available for this case to our knowledge.
Keywords :
Markov processes; approximation theory; matrix algebra; Markovian decision processes; approximation error; linear fixed point equations; projected linear equations; Approximation error; Computational modeling; Computer science; Cost function; Difference equations; Dynamic programming; Learning; Linear approximation; Stochastic processes; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communication, Control, and Computing, 2008 46th Annual Allerton Conference on
Conference_Location :
Urbana-Champaign, IL
Print_ISBN :
978-1-4244-2925-7
Electronic_ISBN :
978-1-4244-2926-4
Type :
conf
DOI :
10.1109/ALLERTON.2008.4797685
Filename :
4797685
Link To Document :
بازگشت