Applying statistical generalization to determine search direction for reinforcement learning of movement primitives

Author

Nemec, Bojan ; Forte, Domenic ; Vuga, Rok ; Tamosiunaite, Minijia ; Worgotter, Florentin ; Ude, Ales

Author_Institution

Dept. of Automatics, Biocybernetics, & Robot., Jozef Stefan Inst., Ljubljana, Slovenia

fYear

2012

fDate

Nov. 29 2012-Dec. 1 2012

Firstpage

65

Lastpage

70

Abstract

In this paper we present a new methodology for robot learning that combines ideas from statistical generalization and reinforcement learning. First we apply statistical generalization to compute an approximation for the optimal control policy as defined by training movements that solve the given task in a number of specific situations. This way we obtain a manifold of movements, which dimensionality is usually much smaller than the dimensionality of a full space of movement primitives. Next we refine the policy by means of reinforcement learning on the approximating manifold, which results in a learning problem constrained to the low dimensional manifold. We show that in some situations, learning on the low dimensional manifold can be implemented as an error learning algorithm. We apply golden section search to refine the control policy. Furthermore, we propose a reinforcement learning algorithm with an extended parameter set, which combines learning in constrained domain with learning in full space of parametric movement primitives, which makes it possible to explore actions outside of the initial approximating manifold. The proposed approach was tested for learning of pouring action both in simulation and on a real robot.

Keywords

approximation theory; control engineering computing; intelligent robots; learning (artificial intelligence); mobile robots; optimal control; statistical analysis; approximating manifold; constrained domain; error learning algorithm; golden section search; low dimensional manifold; optimal control policy; parametric movement primitives; reinforcement learning; robot learning; search direction; statistical generalization; training movements; Aerospace electronics; Joints; Learning (artificial intelligence); Liquids; Manifolds; Robot sensing systems;

fLanguage

English

Publisher

ieee

Conference_Titel

Humanoid Robots (Humanoids), 2012 12th IEEE-RAS International Conference on

Conference_Location

Osaka

ISSN

2164-0572

Type

conf

DOI

10.1109/HUMANOIDS.2012.6651500

Filename

6651500