Title :
Learning mixed behaviours with parallel Q-learning
Author :
Laurent, Guillaume J. ; Piat, Emmanuel
Author_Institution :
Lab. d´´Automatique de Besancon, CNRS, Besancon, France
Abstract :
This paper presents a reinforcement learning algorithm based on a parallel approach of the Watkins´s Q-learning. This algorithm is used to control a two axis micro-manipulator system. The aim is to learn complex behaviour such as reaching target positions and avoiding obstacles at the same time. The simulations and the tests with the real manipulator show that this algorithm is able to learn simultaneously opposite behaviours and that it generates interesting action policies with regard to global path optimization.
Keywords :
collision avoidance; control system synthesis; learning (artificial intelligence); micromanipulators; Watkins Q-learning; action policies; complex behaviour learning; global path optimization; obstacle avoidance; parallel Q-learning; reinforcement learning algorithm; simultaneously opposite behaviours; target positions; two axis micro-manipulator system; Automatic generation control; Biological cells; Control systems; Friction; Glass; Humans; Hysteresis; Learning; Magnetic fields; Microscopy;
Conference_Titel :
Intelligent Robots and Systems, 2002. IEEE/RSJ International Conference on
Print_ISBN :
0-7803-7398-7
DOI :
10.1109/IRDS.2002.1041521