An extended policy gradient algorithm for robot task learning

Author

Cherubini, A. ; Giannone, F. ; Iocchi, L. ; Palamara, P.F.

Author_Institution

Sapienza Univ. of Roma, Rome

fYear

2007

fDate

Oct. 29 2007-Nov. 2 2007

Firstpage

4121

Lastpage

4126

Abstract

In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control parameters) and at high-level (e.g., the behaviors) determine the quality of the robot performance. Thus, for many tasks, robots require fine tuning of the parameters, in the implementation of behaviors and basic control actions, as well as in strategic decisional processes. In recent years, machine learning techniques have been used to find optimal parameter sets for different behaviors. However, a drawback of learning techniques is time consumption: in practical applications, methods designed for physical robots must be effective with small amounts of data. In this paper, we present a method for concurrent learning of best strategy and optimal parameters, by extending the policy gradient reinforcement learning algorithm. The results of our experimental work in a simulated environment and on a real robot show a very high convergence rate.

Keywords

gradient methods; learning (artificial intelligence); robots; extended policy gradient algorithm; learning techniques; optimal parameter sets; reinforcement learning algorithm; robot task learning; strategic decisional processes; Cognitive robotics; Design methodology; Genetic programming; Intelligent robots; Learning systems; Machine learning; Motion control; Robot kinematics; Robot sensing systems; Robot vision systems;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ International Conference on

Conference_Location

San Diego, CA

Print_ISBN

978-1-4244-0912-9

Electronic_ISBN

978-1-4244-0912-9

Type

conf

DOI

10.1109/IROS.2007.4399219

Filename

4399219