Title :
Least-Squares SARSA(Lambda) Algorithms for Reinforcement Learning
Author :
Chen, Sheng-Lei ; Wei, Yan-Mei
Author_Institution :
Sch. of Inf. Sci., Nanjing Audit Univ., Nanjing
Abstract :
The problem of slow convergence speed and low efficiency of experience exploitation in SARSA(lambda) learning is analyzed. And then the least-squares approximation model of the state-action pair´s value function is constructed according to current and previous experiences. A set of linear equations is derived, which is satisfied by the weight vector of function approximator on a set of basis. Thus the fast and practical least-squares SARSA(lambda) algorithm and improved recursive algorithm are proposed. The experiment of inverted pendulum demonstrates that these algorithms can effectively improve convergence speed and the efficiency of experience exploitation.
Keywords :
function approximation; learning (artificial intelligence); least squares approximations; function approximator; inverted pendulum; least-squares SARSA(lambda) algorithms; linear equations; reinforcement learning; slow convergence speed; state-action pair value function; Algorithm design and analysis; Convergence; Equations; Information analysis; Information science; Information technology; Machine learning algorithms; Sampling methods; TV; Vectors;
Conference_Titel :
Natural Computation, 2008. ICNC '08. Fourth International Conference on
Conference_Location :
Jinan
Print_ISBN :
978-0-7695-3304-9
DOI :
10.1109/ICNC.2008.694