DocumentCode :
716094
Title :
Learning contact-rich manipulation skills with guided policy search
Author :
Levine, Sergey ; Wagener, Nolan ; Abbeel, Pieter
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of California, Berkeley, Berkeley, CA, USA
fYear :
2015
fDate :
26-30 May 2015
Firstpage :
156
Lastpage :
163
Abstract :
Autonomous learning of object manipulation skills can enable robots to acquire rich behavioral repertoires that scale to the variety of objects found in the real world. However, current motion skill learning methods typically restrict the behavior to a compact, low-dimensional representation, limiting its expressiveness and generality. In this paper, we extend a recently developed policy search method [1] and use it to learn a range of dynamic manipulation behaviors with highly general policy representations, without using known models or example demonstrations. Our approach learns a set of trajectories for the desired motion skill by using iteratively refitted time-varying linear models, and then unifies these trajectories into a single control policy that can generalize to new situations. To enable this method to run on a real robot, we introduce several improvements that reduce the sample count and automate parameter selection. We show that our method can acquire fast, fluent behaviors after only minutes of interaction time, and can learn robust controllers for complex tasks, including putting together a toy airplane, stacking tight-fitting lego blocks, placing wooden rings onto tight-fitting pegs, inserting a shoe tree into a shoe, and screwing bottle caps onto bottles.
Keywords :
learning systems; linear systems; robots; robust control; search problems; time-varying systems; autonomous object manipulation skill learning; compact low-dimensional representation; contact-rich manipulation skill learning; dynamic manipulation behaviors; guided policy search method; iterativel refitted time-varying linear models; learn robust controllers; motion skill learning methods; parameter selection automation; sample count reduction; single control policy; tight-fitting lego block stacking; toy airplane; Cost function; Heuristic algorithms; Neural networks; Robots; Training; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2015 IEEE International Conference on
Conference_Location :
Seattle, WA
Type :
conf
DOI :
10.1109/ICRA.2015.7138994
Filename :
7138994
Link To Document :
بازگشت