مرکز منطقه ای اطلاع رساني علوم و فناوري - Approximate real-time optimal control based on sparse Gaussian process models

DocumentCode :

1799300

Title :

Approximate real-time optimal control based on sparse Gaussian process models

Author :

Boedecker, Joschka ; Springenberg, Jost Tobias ; Wulfing, Jan ; Riedmiller, Martin

Author_Institution :

Dept. of Comput. Sci., Univ. of Freiburg, Freiburg, Germany

fYear :

2014

fDate :

9-12 Dec. 2014

Firstpage :

Lastpage :

Abstract :

In this paper we present a fully automated approach to (approximate) optimal control of non-linear systems. Our algorithm jointly learns a non-parametric model of the system dynamics - based on Gaussian Process Regression (GPR) - and performs receding horizon control using an adapted iterative LQR formulation. This results in an extremely data-efficient learning algorithm that can operate under real-time constraints. When combined with an exploration strategy based on GPR variance, our algorithm successfully learns to control two benchmark problems in simulation (two-link manipulator, cart-pole) as well as to swing-up and balance a real cart-pole system. For all considered problems learning from scratch, that is without prior knowledge provided by an expert, succeeds in less than 10 episodes of interaction with the system.

Keywords :

Gaussian processes; learning systems; linear quadratic control; manipulators; nonlinear dynamical systems; regression analysis; GPR variance; Gaussian process regression; approximate real-time optimal control; cart-pole system; data-efficient learning algorithm; iterative LQR formulation; nonlinear systems; receding horizon control; sparse Gaussian process models; system dynamics nonparametric model; two-link manipulator; Approximation algorithms; Approximation methods; Computational modeling; Optimal control; Optimization; Predictive models; Trajectory;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014 IEEE Symposium on

Conference_Location :

Orlando, FL

Type :

conf

DOI :

10.1109/ADPRL.2014.7010608

Filename :

7010608

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1799300