• DocumentCode
    2503222
  • Title

    Reinforcement learning method for continuous state space based on dynamic neural network

  • Author

    Sun, Wei ; Wang, Xuesong ; Cheng, Yuhu

  • Author_Institution
    Sch. of Inf. & Electr. Eng., China Univ. of Min. & Technol., Xuzhou
  • fYear
    2008
  • fDate
    25-27 June 2008
  • Firstpage
    750
  • Lastpage
    754
  • Abstract
    One of the difficulties encountered in the application of reinforcement learning methods to real-world problem is the generalization of large-scale or continuous state space. In order to solve the curse of dimensionality problem caused by discretizing continuous state space, a kind of Q-learning method for continuous state space based on a dynamic Elman neural network was proposed in this paper. The inputs and the output of Elman network are the system state-action pair and the corresponding Q-value. That is, Elman network is used to estimate the Q-value of state-action pair on-line. Eligibility trace for connecting weights is introduced by borrowing ideas from the eligibility trace mechanism of state in temporal difference algorithm to improve the learning speed of neural network. Computer simulations on mountain car control illustrate the performance and applicability of the proposed Q-learning scheme.
  • Keywords
    continuous systems; generalisation (artificial intelligence); large-scale systems; learning (artificial intelligence); neurocontrollers; state-space methods; Q-learning method; Q-value estimation; continuous state space generalization; dynamic Elman neural network; eligibility trace mechanism; large-scale system generalization; reinforcement learning control method; state-action pair; temporal difference algorithm; Animals; Control systems; Learning systems; Machine learning; Neural networks; Nonlinear dynamical systems; Nonlinear systems; State-space methods; Supervised learning; Unsupervised learning; Q-learning; dynamic neural network; generalization; reinforcement learning; state space;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
  • Conference_Location
    Chongqing
  • Print_ISBN
    978-1-4244-2113-8
  • Electronic_ISBN
    978-1-4244-2114-5
  • Type

    conf

  • DOI
    10.1109/WCICA.2008.4594438
  • Filename
    4594438