مرکز منطقه ای اطلاع رساني علوم و فناوري - Stochastic kernel temporal difference for reinforcement learning

DocumentCode :

2133543

Title :

Stochastic kernel temporal difference for reinforcement learning

Author :

Bae, Jihye ; Giraldo, Luis Sanchez ; Chhatbar, Pratik ; Francis, Joseph ; Sanchez, Justin ; Principe, Jose

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Florida, Gainesville, FL, USA

fYear :

2011

fDate :

18-21 Sept. 2011

Firstpage :

Lastpage :

Abstract :

This paper introduces a kernel adaptive filter using the stochastic gradient on temporal differences, kernel TD(λ), to estimate the state-action value function Q in reinforcement learning. Kernel methods are powerful for solving nonlinear problems, but the growing computational complexity and memory size limit their applicability on practical scenarios. To overcome this, the quantization approach introduced in [1] is applied. To help understand the behavior and illustrate the role of the parameters, we apply the algorithm on a 2-dimentional spatial navigation task. Eligibility traces are commonly applied in TD learning to improve data efficiency, so the relations of eligibility trace λ and step size and filter size are observed. Moreover, kernel TD (0) is applied to neural decoding of an 8 target center-out reaching task performed by a monkey. Results show the method can effectively learn the brain-state action mapping for this task.

Keywords :

computational complexity; gradient methods; learning (artificial intelligence); stochastic processes; brain state action mapping; computational complexity; data efficiency; kernel adaptive filter; neural decoding; nonlinear problems; reinforcement learning; state action value function; stochastic gradient; stochastic kernel temporal difference; temporal differences; Decoding; Kernel; Learning; Least squares approximation; Matched filters; Quantization; Signal processing algorithms; Temporal difference learning; adaptive filtering; kernel methods; reinforcement learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning for Signal Processing (MLSP), 2011 IEEE International Workshop on

Conference_Location :

Santander

ISSN :

1551-2541

Print_ISBN :

978-1-4577-1621-8

Electronic_ISBN :

1551-2541

Type :

conf

DOI :

10.1109/MLSP.2011.6064634

Filename :

6064634

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2133543