مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

137748

Title :

Learning of closed-loop motion control

Author :

Farshidian, Farbod ; Neunert, Michael ; Buchli, Jonas

Author_Institution :

Agile & Dexterous Robot. Lab., ETH Zurich, Zurich, Switzerland

fYear :

2014

fDate :

14-18 Sept. 2014

Firstpage :

1441

Lastpage :

1446

Abstract :

Learning motion control as a unified process of designing the reference trajectory and the controller is one of the most challenging problems in robotics. The complexity of the problem prevents most of the existing optimization algorithms from giving satisfactory results. While model-based algorithms like iterative linear-quadratic-Gaussian (iLQG) can be used to design a suitable controller for the motion control, their performance is strongly limited by the model accuracy. An inaccurate model may lead to degraded performance of the controller on the physical system. Although using machine learning approaches to learn the motion control on real systems have been proven to be effective, their performance depends on good initialization. To address these issues, this paper introduces a two-step algorithm which combines the proven performance of a model-based controller with a model-free method for compensating for model inaccuracy. The first step optimizes the problem using iLQG. Then, in the second step this controller is used to initialize the policy for our PI²-01 reinforcement learning algorithm. This algorithm is a derivation of the PI² algorithm enabling more stable and faster convergence. The performance of this method is demonstrated both in simulation and experimental results.

Keywords :

PI control; closed loop systems; compensation; control system synthesis; iterative methods; learning (artificial intelligence); linear quadratic Gaussian control; motion control; PI²-01 reinforcement learning algorithm; closed-loop motion control learning; controller design; convergence; iLQG control; iterative linear-quadratic-Gaussian control; machine learning approaches; model inaccuracy compensation; model-based algorithms; model-based controller; model-free method; physical system; real systems; reference trajectory design; two-step algorithm; unified process; Adaptation models; Algorithm design and analysis; Cost function; Heuristic algorithms; Noise; Robots; Trajectory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Robots and Systems (IROS 2014), 2014 IEEE/RSJ International Conference on

Conference_Location :

Chicago, IL

Type :

conf

DOI :

10.1109/IROS.2014.6942746

Filename :

6942746

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=137748