• DocumentCode
    2851639
  • Title

    Reinforcement learning method based on semi-parametric regression model

  • Author

    Cheng, Yuhu ; Wang, Xuesong ; Tian, Xilan

  • Author_Institution
    Sch. of Inf. & Electr. Eng., China Univ. of Min. & Technol., Xuzhou, China
  • fYear
    2010
  • fDate
    26-28 May 2010
  • Firstpage
    11
  • Lastpage
    15
  • Abstract
    In order to make full use of the advantages of both parametric and non-parametric models simultaneously, a kind of semi-parametric support vector machine (SVM) was proposed by combining a non-parametric SVM model and a parametric linear basis function model. The semi-parametric SVM was used to estimate the Q values of continuous-state-discontinuous-action pairs in an on-line manner so as to generalize a standard Q learning method to continuous state spaces. Simulation results concerning the balancing control problem of an inverted pendulum show that the proposed Q learning method has good adaptability for changes of system parameters and initial states, which provides a new approach to solve the generalization problem of continuous space of reinforcement learning.
  • Keywords
    learning (artificial intelligence); regression analysis; support vector machines; Q learning method; SVM; continuous-state-discontinuous-action pairs; inverted pendulum; parametric linear basis function model; reinforcement learning method; semiparametric regression model; support vector machine; Convergence; Electronic mail; Fuzzy logic; Learning systems; Neural networks; Optimization methods; Parametric statistics; State estimation; State-space methods; Support vector machines; Regression model; Reinforcement learning; Semi-parametric; Support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control and Decision Conference (CCDC), 2010 Chinese
  • Conference_Location
    Xuzhou
  • Print_ISBN
    978-1-4244-5181-4
  • Electronic_ISBN
    978-1-4244-5182-1
  • Type

    conf

  • DOI
    10.1109/CCDC.2010.5499145
  • Filename
    5499145