Title :
Actor-critic neural network reinforcement learning for walking control of a 5-link bipedal robot
Author :
Vaghei, Yasaman ; Ghanbari, Ahmad ; Noorani, Sayyed Mohammad Reza Sayyed
Author_Institution :
Sch. of Eng. Emerging Technol., Univ. of Tabriz, Tabriz, Iran
Abstract :
Today, researches on adaptive control have focused on bio-inspired learning techniques to deal with real-life applications. Reinforcement Learning (RL) is one of these major techniques, which has been widely used in robot control tasks recently. On the other hand, artificial neural networks are an accurate approximation tool in nonlinear robotic dynamic control tasks. In this paper, our main goal was to combine the advantages of the artificial neural networks and the RL to reduce the learning time length and enhance the control accuracy. Therefore, we have implemented one of the promising RL approaches, actor-critic RL to control the actuation torques of a planar five-link bipedal robot and retain the passive torso in the vertical position. Our control agent consists of two three-layered neural network units, known as the critic and the actor for learning prediction and learning control tasks. These units are synchronized by the temporal difference error, which implements the eligibility trace vector to assign credit or blame for the error. Moreover, since the neural networks are implemented in both of the actor and the critic sections, we have added a learning database to reduce the probability of inaccurate approximation of the nonlinear functions. Results of our presented control method reveal its perfect performance in stable walking control of the bipedal robot.
Keywords :
adaptive control; control engineering computing; function approximation; intelligent robots; learning (artificial intelligence); legged locomotion; neurocontrollers; probability; robot dynamics; stability; torque control; vectors; 5-link bipedal robot; actor-critic RL; actor-critic neural network reinforcement learning; actuation torque control; adaptive control; approximation tool; artificial neural networks; bioinspired learning techniques; control agent; eligibility trace vector; learning control tasks; learning database; learning prediction; nonlinear function approximation; nonlinear robotic dynamic control tasks; passive torso; planar five-link bipedal robot; probability; robot control tasks; temporal difference error; three-layered neural network units; walking control stability; Artificial neural networks; Biological neural networks; Learning (artificial intelligence); Legged locomotion; Torso; biped robot; cognitive robot; neural network reinforcement learning; stable walking;
Conference_Titel :
Robotics and Mechatronics (ICRoM), 2014 Second RSI/ISM International Conference on
Conference_Location :
Tehran
DOI :
10.1109/ICRoM.2014.6990997