DocumentCode :
1846327
Title :
Online support vector regression based actor-critic method
Author :
Lee, Dong-Hyun ; Kim, Jeong-Jung ; Lee, Ju-Jang
Author_Institution :
Robot. Program, KAIST, Daejeon, South Korea
fYear :
2010
fDate :
7-10 Nov. 2010
Firstpage :
193
Lastpage :
198
Abstract :
This paper proposes a new algorithm for actor-critic method using online support vector regression(SVR), which can do incremental learning and automatically track variation of environment with time-varying characteristics. It gives good generalization properties to value function approximation and helps the critic converge fast. In addition, sample vectors in data set of the online SVR are used as center positions of actor´s basis functions. Actor updates policy parameters with those functions using policy gradient algorithm. Throughout simulations, the feasibility and usefulness of the proposed method is demonstrated by comparison with other methods.
Keywords :
function approximation; generalisation (artificial intelligence); gradient methods; learning (artificial intelligence); support vector machines; actor basis functions; automatically track variation; generalization properties; incremental learning; online SVR; online support vector regression; policy gradient algorithm; time varying characteristics; value function approximation; Approximation algorithms; Equations; Function approximation; Learning; Mathematical model; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
IECON 2010 - 36th Annual Conference on IEEE Industrial Electronics Society
Conference_Location :
Glendale, AZ
ISSN :
1553-572X
Print_ISBN :
978-1-4244-5225-5
Electronic_ISBN :
1553-572X
Type :
conf
DOI :
10.1109/IECON.2010.5675206
Filename :
5675206
Link To Document :
بازگشت