Title :
Reinforcement learning of sensor-based reaching strategies for a two-link manipulator
Author :
Martin, Pedro ; Millán, José Del R
Author_Institution :
Dept. of Comput. Sci., Jaume I Univ., Castellon, Spain
Abstract :
This paper presents a neural controller that learns goal-oriented obstacle-avoiding reaction strategies for a multilink robot arm. It acquires these strategies through reinforcement learning from local sensory data. The robot arm has rings of range sensors placed along its links. The neural controller achieves a good performance quite rapidly and shows good generalization abilities in the face of new environments. Suitable input and output codification schemes help greatly to attain these aims. The input codification exploits the inherent symmetry of the robot kinematics and the action given by the controller is interpreted with regard to the shortest path vector (SPV) to the closest goal in the configuration space. In order to avoid the SPV computation for multilink manipulators, we put forward the use of a module for differential inverse kinematics based on the inversion of a neural network that has been previously trained to approximate the manipulator forward kinematics. The use of this module does not only get round the SPV calculation, but also speeds up the learning process
Keywords :
generalisation (artificial intelligence); intelligent control; learning (artificial intelligence); manipulator kinematics; neurocontrollers; path planning; robot programming; generalization abilities; goal-oriented obstacle-avoiding reaction strategies; multilink robot arm; neural controller; reinforcement learning; robot kinematics; sensor-based reaching strategies; shortest path vector; two-link manipulator; Artificial neural networks; Computer networks; Computer science; Control systems; Learning; Manipulators; Orbital robotics; Robot kinematics; Robot sensing systems; Robotics and automation;
Conference_Titel :
Intelligent Robots and Systems '96, IROS 96, Proceedings of the 1996 IEEE/RSJ International Conference on
Conference_Location :
Osaka
Print_ISBN :
0-7803-3213-X
DOI :
10.1109/IROS.1996.568991