• DocumentCode
    2248121
  • Title

    A Policy Grad Grad Grad Grad ient Reinforcement Learning Algorithm with Fuzzy Function Approximation

  • Author

    Gu, Dongbing ; Yang, Erfu

  • Author_Institution
    Dept. of Comput. Sci., Essex Univ., Colchester
  • fYear
    2004
  • fDate
    22-26 Aug. 2004
  • Firstpage
    936
  • Lastpage
    940
  • Abstract
    For complex systems, reinforcement learning has to be generalised from a discrete form to a continuous form due to large state or action spaces. In this paper, the generalisation of reinforcement learning to continuous state space is investigated by using a policy gradient approach. Fuzzy logic is used as a function approximation in the generalisation. To guarantee learning convergence, a policy approximator and a state action value approximator are employed for the reinforcement learning. Both of them are based on fuzzy logic. The convergence of the learning algorithm is justified
  • Keywords
    convergence; function approximation; fuzzy logic; generalisation (artificial intelligence); gradient methods; learning (artificial intelligence); continuous state space; fuzzy function approximation; fuzzy logic; policy approximator; policy gradient method; reinforcement learning algorithm; state action value approximator; Approximation algorithms; Convergence; Function approximation; Fuzzy logic; Gradient methods; Machine learning; Machine learning algorithms; Orbital robotics; Robot sensing systems; State-space methods; Reinforcement learning; fuzzy Q-learning; policy gradient method;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Biomimetics, 2004. ROBIO 2004. IEEE International Conference on
  • Conference_Location
    Shenyang
  • Print_ISBN
    0-7803-8614-8
  • Type

    conf

  • DOI
    10.1109/ROBIO.2004.1521910
  • Filename
    1521910