DocumentCode :
716566
Title :
Beyond lowest-warping cost action selection in trajectory transfer
Author :
Hadfield-Menell, Dylan ; Lee, Alex X. ; Finn, Chelsea ; Tzeng, Eric ; Huang, Sandy ; Abbeel, Pieter
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of California at Berkeley, Berkeley, CA, USA
fYear :
2015
fDate :
26-30 May 2015
Firstpage :
3231
Lastpage :
3238
Abstract :
We consider the problem of learning from demonstrations to manipulate deformable objects. Recent work [1], [2], [3] has shown promising results that enable robotic manipulation of deformable objects through learning from demonstrations. Their approach is able to generalize from a single demonstration to new test situations, and suggests a nearest neighbor approach to select a demonstration to adapt to a given test situation. Such a nearest neighbor approach, however, ignores important aspects of the problem: brittleness (versus robustness) of demonstrations when generalized through this process, and the extent to which a demonstration makes progress towards a goal. In this paper, we frame the problem of selecting which demonstration to transfer as an options Markov decision process (MDP). We present max-margin Q-function estimation: an approach to learn a Q-function from expert demonstrations. Our learned policies account for variability in robustness of demonstrations and the sequential nature of our tasks. We developed two knot-tying benchmarks to experimentally validate the effectiveness of our proposed approach. The selection strategy described in [2] achieves success rates of 70% and 54%, respectively. Our approach performs significantly better, with success rates of 88% and 76%, respectively.
Keywords :
Markov processes; end effectors; learning by example; Markov decision process; end-effector; learning from demonstrations; max-margin Q-function estimation; trajectory transfer; warping cost action selection; Libraries; Mathematical model; Optimization; Robots; Robustness; Three-dimensional displays; Trajectory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation (ICRA), 2015 IEEE International Conference on
Conference_Location :
Seattle, WA
Type :
conf
DOI :
10.1109/ICRA.2015.7139644
Filename :
7139644
Link To Document :
بازگشت