• DocumentCode
    2011696
  • Title

    Reinforcement learning neurocontroller applied to a 2-DOF manipulator

  • Author

    Perez-Cisneros, M.A. ; Leal-Ascencio, Raúl ; Cook, P.A.

  • Author_Institution
    Control Syst. Centre, Univ. of Manchester Inst. of Sci. & Technol., UK
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    56
  • Lastpage
    61
  • Abstract
    This paper describes the capabilities of a reinforcement learning (RL) algorithm which uses two neural net structures to produce a direct inverse neurocontrol scheme. The PendubotTM a double inverted pendulum which is a nonlinear dynamic system inherently unstable, is used as benchmark plant because it could be attractive for testing control schemes. The RL neurocontroller is a learning system which consists of two connectionist nets, the action net (AN) and the evaluation net (EN). The action net generates the system´s behavior and the evaluation net learns an evaluation function of Pendubot´s states. A zero magnitude force is not permitted and the neurocontroller always is supplying a control signal for PendubotTM. The paper also describes a neurocontroller feature consisting of a nonlinear function added to the control signal when the second link approaches critic states. Results from training and operation stage are summarized for both neurocontrollers. Finally, the mass of link 2 of PendubotTM is altered increasing and decreasing its magnitude in order to observe the generalization capabilities in the neurocontroller. This last experiment is also documented
  • Keywords
    generalisation (artificial intelligence); learning (artificial intelligence); manipulators; neurocontrollers; nonlinear control systems; nonlinear dynamical systems; pendulums; stability; 2-DOF manipulator; AN; EN; Pendubot; RL algorithm; action network; direct inverse neurocontrol scheme; double inverted pendulum; evaluation network; inherently unstable nonlinear dynamic system; nonlinear function; reinforcement learning neurocontroller; Artificial neural networks; Benchmark testing; Control systems; Learning systems; Neural networks; Neurocontrollers; Nonlinear control systems; Nonlinear dynamical systems; Shafts; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control, 2001. (ISIC '01). Proceedings of the 2001 IEEE International Symposium on
  • Conference_Location
    Mexico City
  • ISSN
    2158-9860
  • Print_ISBN
    0-7803-6722-7
  • Type

    conf

  • DOI
    10.1109/ISIC.2001.971484
  • Filename
    971484