• DocumentCode
    2337375
  • Title

    An extended policy gradient algorithm for robot task learning

  • Author

    Cherubini, A. ; Giannone, F. ; Iocchi, L. ; Palamara, P.F.

  • Author_Institution
    Sapienza Univ. of Roma, Rome
  • fYear
    2007
  • fDate
    Oct. 29 2007-Nov. 2 2007
  • Firstpage
    4121
  • Lastpage
    4126
  • Abstract
    In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control parameters) and at high-level (e.g., the behaviors) determine the quality of the robot performance. Thus, for many tasks, robots require fine tuning of the parameters, in the implementation of behaviors and basic control actions, as well as in strategic decisional processes. In recent years, machine learning techniques have been used to find optimal parameter sets for different behaviors. However, a drawback of learning techniques is time consumption: in practical applications, methods designed for physical robots must be effective with small amounts of data. In this paper, we present a method for concurrent learning of best strategy and optimal parameters, by extending the policy gradient reinforcement learning algorithm. The results of our experimental work in a simulated environment and on a real robot show a very high convergence rate.
  • Keywords
    gradient methods; learning (artificial intelligence); robots; extended policy gradient algorithm; learning techniques; optimal parameter sets; reinforcement learning algorithm; robot task learning; strategic decisional processes; Cognitive robotics; Design methodology; Genetic programming; Intelligent robots; Learning systems; Machine learning; Motion control; Robot kinematics; Robot sensing systems; Robot vision systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ International Conference on
  • Conference_Location
    San Diego, CA
  • Print_ISBN
    978-1-4244-0912-9
  • Electronic_ISBN
    978-1-4244-0912-9
  • Type

    conf

  • DOI
    10.1109/IROS.2007.4399219
  • Filename
    4399219