• DocumentCode
    250726
  • Title

    Multi-task policy search for robotics

  • Author

    Deisenroth, Marc Peter ; Englert, Peter ; Peters, Jochen ; Fox, D.

  • Author_Institution
    Dept. of Comput., Imperial Coll. London, London, UK
  • fYear
    2014
  • fDate
    May 31 2014-June 7 2014
  • Firstpage
    3876
  • Lastpage
    3881
  • Abstract
    Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled approaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in real-robot experiments are shown.
  • Keywords
    feedback; intelligent robots; learning (artificial intelligence); learning systems; nonlinear control systems; continuous task variations; imitation learning; individual policy training; knowledge transfer; multitask policy search; nonlinear feedback policy; reinforcement learning; robotics; Approximation methods; Artificial neural networks; Cameras; Grippers; Robot kinematics; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation (ICRA), 2014 IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Type

    conf

  • DOI
    10.1109/ICRA.2014.6907421
  • Filename
    6907421