Title :
Trajectory-model-based reinforcement learning: Application to bimanual humanoid motor learning with a closed-chain constraint
Author :
Sugimoto, Norikazu ; Morimoto, Jun
Author_Institution :
Dept. of Brain Machine Interface, Nat. Inst. of Inf. & Commun. Technol., Suita, Japan
Abstract :
We propose a reinforcement learning (RL) framework to improve policies for a high-dimensional system through fewer interactions with real environments than standard RL methods. In our learning framework, we first use off-line simulations to improve the controller parameters with an approximated environment model to generate samples along locally optimized trajectories. We then use the approximated dynamics to improve the performance of a tool manipulation task in a path integral RL framework, which updates a policy from the sampled trajectories of the state and action vectors and the cost. In this study, we apply our proposed method to a bimanual humanoid motor learning task in which we need to explicitly consider a closed-chain constraint. We show that a 51-DOF real humanoid robot can learn to manipulate a rod to hit via-points using both arms within 36 interactions in a real environment.
Keywords :
control engineering computing; humanoid robots; learning (artificial intelligence); trajectory control; vectors; 51-DOF real humanoid robot; bimanual humanoid motor learning; closed-chain constraint; high-dimensional system; path integral RL framework; reinforcement learning; tool manipulation task; trajectory-model; vector; Humanoid robots; Joints; Predictive models; Standards; Trajectory; Vectors;
Conference_Titel :
Humanoid Robots (Humanoids), 2013 13th IEEE-RAS International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4799-2617-6
DOI :
10.1109/HUMANOIDS.2013.7030010