DocumentCode
2007013
Title
Applying statistical generalization to determine search direction for reinforcement learning of movement primitives
Author
Nemec, Bojan ; Forte, Domenic ; Vuga, Rok ; Tamosiunaite, Minijia ; Worgotter, Florentin ; Ude, Ales
Author_Institution
Dept. of Automatics, Biocybernetics, & Robot., Jozef Stefan Inst., Ljubljana, Slovenia
fYear
2012
fDate
Nov. 29 2012-Dec. 1 2012
Firstpage
65
Lastpage
70
Abstract
In this paper we present a new methodology for robot learning that combines ideas from statistical generalization and reinforcement learning. First we apply statistical generalization to compute an approximation for the optimal control policy as defined by training movements that solve the given task in a number of specific situations. This way we obtain a manifold of movements, which dimensionality is usually much smaller than the dimensionality of a full space of movement primitives. Next we refine the policy by means of reinforcement learning on the approximating manifold, which results in a learning problem constrained to the low dimensional manifold. We show that in some situations, learning on the low dimensional manifold can be implemented as an error learning algorithm. We apply golden section search to refine the control policy. Furthermore, we propose a reinforcement learning algorithm with an extended parameter set, which combines learning in constrained domain with learning in full space of parametric movement primitives, which makes it possible to explore actions outside of the initial approximating manifold. The proposed approach was tested for learning of pouring action both in simulation and on a real robot.
Keywords
approximation theory; control engineering computing; intelligent robots; learning (artificial intelligence); mobile robots; optimal control; statistical analysis; approximating manifold; constrained domain; error learning algorithm; golden section search; low dimensional manifold; optimal control policy; parametric movement primitives; reinforcement learning; robot learning; search direction; statistical generalization; training movements; Aerospace electronics; Joints; Learning (artificial intelligence); Liquids; Manifolds; Robot sensing systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Humanoid Robots (Humanoids), 2012 12th IEEE-RAS International Conference on
Conference_Location
Osaka
ISSN
2164-0572
Type
conf
DOI
10.1109/HUMANOIDS.2012.6651500
Filename
6651500
Link To Document