DocumentCode :
3706914
Title :
Temporal-Difference learning an online support vector regression approach
Author :
Hugo Tanzarella Teixeira;Celso Pascoli Bottura
Author_Institution :
State University of Campinas - UNICAMP, School of Electrical and Computer Engineering - FEEC, DSIF-LCSI, Av. Albert Einstein, N. 400 - LE31 - CEP 13081-970, SP, Brazil
Volume :
1
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
318
Lastpage :
323
Abstract :
This paper proposes a new algorithm for Temporal-Difference (TD) learning using online support vector regression. It benefits from the good generalization properties support vector regression (SVR) has, and also can do incremental learning and automatically track variation of environment with time-varying characteristics. Using the online SVR we can obtain good estimation of value function in TD learning in linear and nonlinear prediction problems. Experimental results demonstrate the effectiveness of the proposed method by comparison with others methods.
Keywords :
"Support vector machines","Approximation algorithms","Function approximation","Markov processes","Kernel","Prediction algorithms"
Publisher :
ieee
Conference_Titel :
Informatics in Control, Automation and Robotics (ICINCO), 2015 12th International Conference on
Type :
conf
Filename :
7350483
Link To Document :
بازگشت