Title :
Reinforcement learning algorithm for industrial robot programming by demonstration
Author :
Stoica, M. ; Sisak, F. ; Morosan, A.D.
Author_Institution :
Automatics Dept., Transilvania Univ. of Brasov, Brasov, Romania
Abstract :
Programming by demonstration represent a significant subject in the field of robotics and it is developing more and more in the direction of robots for services and humanoid robots. Programming by demonstration is much less researched, when we talk about industrial robots. One of the reasons is that an industrial robot has to act in a precise and certain manner. However, extending research regarding programming by demonstration in industrial robots area, could lead to development of intelligent systems, where the industrial robot could be programmed in an easier way. In this paper we proposed an algorithm based on reinforcement learning and we developing, implementing and testing this algorithm which can offer flexibility in intelligent systems. Initially, we have focused our research on the creation of a reasoning algorithm based on artificial neural networks, but the results of this algorithm weren´t satisfying, so we have switched our focus towards proposed algorithm. The results of this algorithm is that the robot will be capable to learn from its mistakes and he will know how to act in unknown situation; this will be possible because the robot will get marks for each possible action and he will updates its behavior.
Keywords :
automatic programming; humanoid robots; industrial robots; knowledge based systems; learning (artificial intelligence); robot programming; service robots; humanoid robot; industrial robot; intelligent system; programming by demonstration; reinforcement learning algorithm; service robot; Classification algorithms; Learning; Programming; Robot kinematics; Service robots; Software;
Conference_Titel :
Optimization of Electrical and Electronic Equipment (OPTIM), 2012 13th International Conference on
Conference_Location :
Brasov
Print_ISBN :
978-1-4673-1650-7
Electronic_ISBN :
1842-0133
DOI :
10.1109/OPTIM.2012.6231926