DocumentCode :
1946553
Title :
Improved Simultaneous Perturbation Stochastic Approximation and Its Application in Reinforcement Learning
Author :
Yue, Xiumei
Author_Institution :
Dept. of Electr. & Electron. Eng., Huangshi Inst. of Technol., Huangshi
Volume :
1
fYear :
2008
fDate :
12-14 Dec. 2008
Firstpage :
329
Lastpage :
332
Abstract :
In the optimization problem which only measurements of the objective function are available, it is difficult or impossible to directly obtain the gradient of the objective function. Although the second order simultaneous perturbation stochastic approximation (2SPSA) algorithm solves this problem successfully by efficient gradient approximation that relies on measurements of the objective function, the accuracy of the algorithm depends on the matrix conditioning of the objective function Hessian. In order to eliminate the influence caused by the objective function Hessian, this paper uses nonlinear conjugate gradient method to decide the search direction of the objective function. By synthesizing different nonlinear conjugate gradient methods, it ensures each search direction to be descensive. Besides the search direction improvement, this paper also uses inexact line searches to decide the stepsize of movement. With the descensive search direction and appropriate stepsize, the improved SPSA converges faster than the 2SPSA. Through applying to reinforcement learning, the virtues of the improved SPSA are validated.
Keywords :
Hessian matrices; approximation theory; conjugate gradient methods; learning (artificial intelligence); stochastic processes; gradient approximation; matrix conditioning; nonlinear conjugate gradient method; objective function Hessian; reinforcement learning; simultaneous perturbation stochastic approximation algorithm; Acceleration; Application software; Approximation algorithms; Computer science; Convergence; Finite difference methods; Gradient methods; Learning; Software engineering; Stochastic processes; SPSA; nonlinear conjugate gradient method; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
Type :
conf
DOI :
10.1109/CSSE.2008.1019
Filename :
4721754
Link To Document :
بازگشت