Imitation and Reinforcement Learning

Author

Kober, Jens ; Peters, Jan

Author_Institution

Dept. of Empirical Inference & Machine Learning, Max Planck Inst. for Biol. Cybern., Tübingen, Germany

Volume

17

Issue

2

fYear

2010

fDate

6/1/2010 12:00:00 AM

Firstpage

55

Lastpage

62

Abstract

In this article, we present both novel learning algorithms and experiments using the dynamical system MPs. As such, we describe this MP representation in a way that it is straightforward to reproduce. We review an appropriate imitation learning method, i.e., locally weighted regression, and show how this method can be used both for initializing RL tasks as well as for modifying the start-up phase in a rhythmic task. We also show our current best-suited RL algorithm for this framework, i.e., PoWER. We present two complex motor tasks, i.e., ball-in-a-cup and ball paddling, learned on a real, physical Barrett WAM, using the methods presented in this article. Of particular interest is the ball-paddling application, as it requires a combination of both rhythmic and discrete dynamical systems MPs during the start-up phase to achieve a particular task.

Keywords

discrete systems; learning (artificial intelligence); regression analysis; robots; ball paddling; ball-in-a-cup; discrete dynamical system; imitation learning; industrial robots; motor primitive; reinforcement learning; weighted regression; whole arm manipulator; Anthropomorphism; Humans; Intelligent robots; Learning systems; Legged locomotion; Manufacturing; Planar motors; Robot programming; Service robots; Stability;

fLanguage

English

Journal_Title

Robotics Automation Magazine, IEEE

Publisher

ieee

Conference_Location

6/1/2010 12:00:00 AM

ISSN

1070-9932

Type

jour

DOI

10.1109/MRA.2010.936952

Filename

5480345