مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning mixed behaviours with parallel Q-learning

DocumentCode :

382894

Title :

Learning mixed behaviours with parallel Q-learning

Author :

Laurent, Guillaume J. ; Piat, Emmanuel

Author_Institution :

Lab. d´´Automatique de Besancon, CNRS, Besancon, France

Volume :

fYear :

2002

fDate :

2002

Firstpage :

1002

Abstract :

This paper presents a reinforcement learning algorithm based on a parallel approach of the Watkins´s Q-learning. This algorithm is used to control a two axis micro-manipulator system. The aim is to learn complex behaviour such as reaching target positions and avoiding obstacles at the same time. The simulations and the tests with the real manipulator show that this algorithm is able to learn simultaneously opposite behaviours and that it generates interesting action policies with regard to global path optimization.

Keywords :

collision avoidance; control system synthesis; learning (artificial intelligence); micromanipulators; Watkins Q-learning; action policies; complex behaviour learning; global path optimization; obstacle avoidance; parallel Q-learning; reinforcement learning algorithm; simultaneously opposite behaviours; target positions; two axis micro-manipulator system; Automatic generation control; Biological cells; Control systems; Friction; Glass; Humans; Hysteresis; Learning; Magnetic fields; Microscopy;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Robots and Systems, 2002. IEEE/RSJ International Conference on

Print_ISBN :

0-7803-7398-7

Type :

conf

DOI :

10.1109/IRDS.2002.1041521

Filename :

1041521

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=382894