Title :
Learning nonparametric policies by imitation
Author :
Grimes, David B. ; Rao, Rajesh P N
Author_Institution :
Dept. of Comput. Sci., Univ. of Washington, Seattle, WA
Abstract :
A long cherished goal in artificial intelligence has been the ability to endow a robot with the capacity to learn and generalize skills from watching a human teacher. Such an ability to learn by imitation has remained hard to achieve due to a number of factors, including the problem of learning in high-dimensional spaces and the problem of uncertainty. In this paper, we propose a new probabilistic approach to the problem of teaching a high degree-of-freedom robot (in particular, a humanoid robot) flexible and generalizable skills via imitation of a human teacher. The robot uses inference in a graphical model to learn sensor-based dynamics and infer a stable plan from a teacherpsilas demonstration of an action. The novel contribution of this work is a method for learning a nonparametric policy which generalizes a fixed action plan to operate over a continuous space of task variation. A notable feature of the approach is that it does not require any knowledge of the physics of the robot or the environment. By leveraging advances in probabilistic inference and Gaussian process regression, the method produces a nonparametric policy for sensor-based feedback control in continuous state and action spaces. We present experimental and simulation results using a Fujitsu HOAP-2 humanoid robot demonstrating imitation-based learning of a task involving lifting objects of different weights from a single human demonstration.
Keywords :
Gaussian processes; feedback; graph theory; humanoid robots; regression analysis; sensors; Fujitsu HOAP-2 humanoid robot; Gaussian process regression; artificial intelligence; degree-of-freedom robot; graphical model; high-dimensional spaces; human teacher; nonparametric policies; probabilistic approach; probabilistic inference; sensor-based feedback control; History; Humanoid robots; Humans; Planning; Probabilistic logic; Robot sensing systems; Robots;
Conference_Titel :
Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on
Conference_Location :
Nice
Print_ISBN :
978-1-4244-2057-5
DOI :
10.1109/IROS.2008.4650778