مرکز منطقه ای اطلاع رساني علوم و فناوري - Reinforcement learning method based on semi-parametric regression model

DocumentCode :

2851639

Title :

Reinforcement learning method based on semi-parametric regression model

Author :

Cheng, Yuhu ; Wang, Xuesong ; Tian, Xilan

Author_Institution :

Sch. of Inf. & Electr. Eng., China Univ. of Min. & Technol., Xuzhou, China

fYear :

2010

fDate :

26-28 May 2010

Firstpage :

Lastpage :

Abstract :

In order to make full use of the advantages of both parametric and non-parametric models simultaneously, a kind of semi-parametric support vector machine (SVM) was proposed by combining a non-parametric SVM model and a parametric linear basis function model. The semi-parametric SVM was used to estimate the Q values of continuous-state-discontinuous-action pairs in an on-line manner so as to generalize a standard Q learning method to continuous state spaces. Simulation results concerning the balancing control problem of an inverted pendulum show that the proposed Q learning method has good adaptability for changes of system parameters and initial states, which provides a new approach to solve the generalization problem of continuous space of reinforcement learning.

Keywords :

learning (artificial intelligence); regression analysis; support vector machines; Q learning method; SVM; continuous-state-discontinuous-action pairs; inverted pendulum; parametric linear basis function model; reinforcement learning method; semiparametric regression model; support vector machine; Convergence; Electronic mail; Fuzzy logic; Learning systems; Neural networks; Optimization methods; Parametric statistics; State estimation; State-space methods; Support vector machines; Regression model; Reinforcement learning; Semi-parametric; Support vector machine;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Control and Decision Conference (CCDC), 2010 Chinese

Conference_Location :

Xuzhou

Print_ISBN :

978-1-4244-5181-4

Electronic_ISBN :

978-1-4244-5182-1

Type :

conf

DOI :

10.1109/CCDC.2010.5499145

Filename :

5499145

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2851639