• DocumentCode
    2007013
  • Title

    Applying statistical generalization to determine search direction for reinforcement learning of movement primitives

  • Author

    Nemec, Bojan ; Forte, Domenic ; Vuga, Rok ; Tamosiunaite, Minijia ; Worgotter, Florentin ; Ude, Ales

  • Author_Institution
    Dept. of Automatics, Biocybernetics, & Robot., Jozef Stefan Inst., Ljubljana, Slovenia
  • fYear
    2012
  • fDate
    Nov. 29 2012-Dec. 1 2012
  • Firstpage
    65
  • Lastpage
    70
  • Abstract
    In this paper we present a new methodology for robot learning that combines ideas from statistical generalization and reinforcement learning. First we apply statistical generalization to compute an approximation for the optimal control policy as defined by training movements that solve the given task in a number of specific situations. This way we obtain a manifold of movements, which dimensionality is usually much smaller than the dimensionality of a full space of movement primitives. Next we refine the policy by means of reinforcement learning on the approximating manifold, which results in a learning problem constrained to the low dimensional manifold. We show that in some situations, learning on the low dimensional manifold can be implemented as an error learning algorithm. We apply golden section search to refine the control policy. Furthermore, we propose a reinforcement learning algorithm with an extended parameter set, which combines learning in constrained domain with learning in full space of parametric movement primitives, which makes it possible to explore actions outside of the initial approximating manifold. The proposed approach was tested for learning of pouring action both in simulation and on a real robot.
  • Keywords
    approximation theory; control engineering computing; intelligent robots; learning (artificial intelligence); mobile robots; optimal control; statistical analysis; approximating manifold; constrained domain; error learning algorithm; golden section search; low dimensional manifold; optimal control policy; parametric movement primitives; reinforcement learning; robot learning; search direction; statistical generalization; training movements; Aerospace electronics; Joints; Learning (artificial intelligence); Liquids; Manifolds; Robot sensing systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Humanoid Robots (Humanoids), 2012 12th IEEE-RAS International Conference on
  • Conference_Location
    Osaka
  • ISSN
    2164-0572
  • Type

    conf

  • DOI
    10.1109/HUMANOIDS.2012.6651500
  • Filename
    6651500