DocumentCode
2507519
Title
Multi-agent hierarchical architecture modeling kinematic chains employing continuous RL learning with fuzzified state space
Author
Karigiannis, John N. ; Tzafestas, Costas S.
Author_Institution
Div. of Signals, Nat. Tech. Univ. of Athens (NTUA), Athens
fYear
2008
fDate
19-22 Oct. 2008
Firstpage
716
Lastpage
723
Abstract
In the context of multi-agent systems, we are proposing a hierarchical robot control architecture that comprises artificial intelligence (AI) techniques and traditional control methodologies, based on the realization of a learning team of agents in a continuous problem setting. In a multi-agent system, action selection is important for cooperation and coordination among the agents. By employing reinforcement learning (RL) methods in a fuzzified state-space, we accomplish to design a control architecture and a corresponding methodology, engaged in a continuous space, which enables the agents to learn, over a period of time, to perform sequences of continuous actions in a cooperative manner, in order to reach their goal without any prior generated task model. By organizing the agents in a nested architecture, as proposed in this work, a type of problem-specific recursive knowledge acquisition is attempted. Furthermore, the agents try to exploit the knowledge gathered in order to be in position to execute tasks that indicate certain degree of similarity. The agents correspond in fact to independent degrees of freedom of the system, and achieve to gain experience over the task that they collaboratively perform, by exploring and exploiting their state-to-action mapping space. A numerical experiment is presented in this paper, performed on a simulated planar 4 degrees of freedom (DOF) manipulator, in order to evaluate both the proposed hierarchical multi-agent architecture as well as the proposed methodological framework. It is anticipated that such an approach can be highly scalable for the control of robotic systems that are kinematically more complex, comprising multiple DOFs and potentially redundancies in open or closed kinematic chains, particularly dexterous manipulators.
Keywords
fuzzy systems; knowledge acquisition; learning (artificial intelligence); manipulators; medical robotics; multi-agent systems; robot kinematics; artificial intelligence; continuous RL learning; dexterous manipulators; fuzzified state space; hierarchical robot control architecture; multiagent systems; recursive knowledge acquisition; reinforcement learning; robot kinematics; state-to-action mapping space; Artificial intelligence; Control systems; Kinematics; Knowledge acquisition; Learning; Manipulators; Multiagent systems; Organizing; Robot control; State-space methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Biomedical Robotics and Biomechatronics, 2008. BioRob 2008. 2nd IEEE RAS & EMBS International Conference on
Conference_Location
Scottsdale, AZ
Print_ISBN
978-1-4244-2882-3
Electronic_ISBN
978-1-4244-2883-0
Type
conf
DOI
10.1109/BIOROB.2008.4762862
Filename
4762862
Link To Document