Title :
Online support vector regression based actor-critic method
Author :
Lee, Dong-Hyun ; Kim, Jeong-Jung ; Lee, Ju-Jang
Author_Institution :
Robot. Program, KAIST, Daejeon, South Korea
Abstract :
This paper proposes a new algorithm for actor-critic method using online support vector regression(SVR), which can do incremental learning and automatically track variation of environment with time-varying characteristics. It gives good generalization properties to value function approximation and helps the critic converge fast. In addition, sample vectors in data set of the online SVR are used as center positions of actor´s basis functions. Actor updates policy parameters with those functions using policy gradient algorithm. Throughout simulations, the feasibility and usefulness of the proposed method is demonstrated by comparison with other methods.
Keywords :
function approximation; generalisation (artificial intelligence); gradient methods; learning (artificial intelligence); support vector machines; actor basis functions; automatically track variation; generalization properties; incremental learning; online SVR; online support vector regression; policy gradient algorithm; time varying characteristics; value function approximation; Approximation algorithms; Equations; Function approximation; Learning; Mathematical model; Support vector machines;
Conference_Titel :
IECON 2010 - 36th Annual Conference on IEEE Industrial Electronics Society
Conference_Location :
Glendale, AZ
Print_ISBN :
978-1-4244-5225-5
Electronic_ISBN :
1553-572X
DOI :
10.1109/IECON.2010.5675206