DocumentCode :
1874019
Title :
Temporal difference learning with interpolated table value functions
Author :
Lucas, Simon M.
Author_Institution :
Sch. of Comput. Sci. & Electron. Eng., Univ. of Essex, Colchester, UK
fYear :
2009
fDate :
7-10 Sept. 2009
Firstpage :
32
Lastpage :
37
Abstract :
This paper introduces a novel function approximation architecture especially well suited to temporal difference learning. The architecture is based on using sets of interpolated table look-up functions. These offer rapid and stable learning, and are efficient when the number of inputs is small. An empirical investigation is conducted to test their performance on a supervised learning task, and on the mountain car problem, a standard reinforcement learning benchmark. In each case, the interpolated table functions offer competitive performance.
Keywords :
interpolation; learning (artificial intelligence); function approximation architecture; interpolated table look-up function; interpolated table value function; mountain car problem; reinforcement learning benchmark; stable learning; supervised learning task; temporal difference learning; Benchmark testing; Computer architecture; Computer science; Counting circuits; Function approximation; Games; Multilayer perceptrons; Sampling methods; Supervised learning; Table lookup;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Games, 2009. CIG 2009. IEEE Symposium on
Conference_Location :
Milano
Print_ISBN :
978-1-4244-4814-2
Electronic_ISBN :
978-1-4244-4815-9
Type :
conf
DOI :
10.1109/CIG.2009.5286496
Filename :
5286496
Link To Document :
بازگشت