DocumentCode :
382894
Title :
Learning mixed behaviours with parallel Q-learning
Author :
Laurent, Guillaume J. ; Piat, Emmanuel
Author_Institution :
Lab. d´´Automatique de Besancon, CNRS, Besancon, France
Volume :
1
fYear :
2002
fDate :
2002
Firstpage :
1002
Abstract :
This paper presents a reinforcement learning algorithm based on a parallel approach of the Watkins´s Q-learning. This algorithm is used to control a two axis micro-manipulator system. The aim is to learn complex behaviour such as reaching target positions and avoiding obstacles at the same time. The simulations and the tests with the real manipulator show that this algorithm is able to learn simultaneously opposite behaviours and that it generates interesting action policies with regard to global path optimization.
Keywords :
collision avoidance; control system synthesis; learning (artificial intelligence); micromanipulators; Watkins Q-learning; action policies; complex behaviour learning; global path optimization; obstacle avoidance; parallel Q-learning; reinforcement learning algorithm; simultaneously opposite behaviours; target positions; two axis micro-manipulator system; Automatic generation control; Biological cells; Control systems; Friction; Glass; Humans; Hysteresis; Learning; Magnetic fields; Microscopy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 2002. IEEE/RSJ International Conference on
Print_ISBN :
0-7803-7398-7
Type :
conf
DOI :
10.1109/IRDS.2002.1041521
Filename :
1041521
Link To Document :
بازگشت