DocumentCode
2851639
Title
Reinforcement learning method based on semi-parametric regression model
Author
Cheng, Yuhu ; Wang, Xuesong ; Tian, Xilan
Author_Institution
Sch. of Inf. & Electr. Eng., China Univ. of Min. & Technol., Xuzhou, China
fYear
2010
fDate
26-28 May 2010
Firstpage
11
Lastpage
15
Abstract
In order to make full use of the advantages of both parametric and non-parametric models simultaneously, a kind of semi-parametric support vector machine (SVM) was proposed by combining a non-parametric SVM model and a parametric linear basis function model. The semi-parametric SVM was used to estimate the Q values of continuous-state-discontinuous-action pairs in an on-line manner so as to generalize a standard Q learning method to continuous state spaces. Simulation results concerning the balancing control problem of an inverted pendulum show that the proposed Q learning method has good adaptability for changes of system parameters and initial states, which provides a new approach to solve the generalization problem of continuous space of reinforcement learning.
Keywords
learning (artificial intelligence); regression analysis; support vector machines; Q learning method; SVM; continuous-state-discontinuous-action pairs; inverted pendulum; parametric linear basis function model; reinforcement learning method; semiparametric regression model; support vector machine; Convergence; Electronic mail; Fuzzy logic; Learning systems; Neural networks; Optimization methods; Parametric statistics; State estimation; State-space methods; Support vector machines; Regression model; Reinforcement learning; Semi-parametric; Support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Control and Decision Conference (CCDC), 2010 Chinese
Conference_Location
Xuzhou
Print_ISBN
978-1-4244-5181-4
Electronic_ISBN
978-1-4244-5182-1
Type
conf
DOI
10.1109/CCDC.2010.5499145
Filename
5499145
Link To Document