Title :
Reinforcement learning neurocontroller applied to a 2-DOF manipulator
Author :
Perez-Cisneros, M.A. ; Leal-Ascencio, Raúl ; Cook, P.A.
Author_Institution :
Control Syst. Centre, Univ. of Manchester Inst. of Sci. & Technol., UK
Abstract :
This paper describes the capabilities of a reinforcement learning (RL) algorithm which uses two neural net structures to produce a direct inverse neurocontrol scheme. The PendubotTM a double inverted pendulum which is a nonlinear dynamic system inherently unstable, is used as benchmark plant because it could be attractive for testing control schemes. The RL neurocontroller is a learning system which consists of two connectionist nets, the action net (AN) and the evaluation net (EN). The action net generates the system´s behavior and the evaluation net learns an evaluation function of Pendubot´s states. A zero magnitude force is not permitted and the neurocontroller always is supplying a control signal for PendubotTM. The paper also describes a neurocontroller feature consisting of a nonlinear function added to the control signal when the second link approaches critic states. Results from training and operation stage are summarized for both neurocontrollers. Finally, the mass of link 2 of PendubotTM is altered increasing and decreasing its magnitude in order to observe the generalization capabilities in the neurocontroller. This last experiment is also documented
Keywords :
generalisation (artificial intelligence); learning (artificial intelligence); manipulators; neurocontrollers; nonlinear control systems; nonlinear dynamical systems; pendulums; stability; 2-DOF manipulator; AN; EN; Pendubot; RL algorithm; action network; direct inverse neurocontrol scheme; double inverted pendulum; evaluation network; inherently unstable nonlinear dynamic system; nonlinear function; reinforcement learning neurocontroller; Artificial neural networks; Benchmark testing; Control systems; Learning systems; Neural networks; Neurocontrollers; Nonlinear control systems; Nonlinear dynamical systems; Shafts; System testing;
Conference_Titel :
Intelligent Control, 2001. (ISIC '01). Proceedings of the 2001 IEEE International Symposium on
Conference_Location :
Mexico City
Print_ISBN :
0-7803-6722-7
DOI :
10.1109/ISIC.2001.971484