مرکز منطقه ای اطلاع رساني علوم و فناوري - Path integral-based stochastic optimal control for rigid body dynamics

DocumentCode :

493381

Title :

Path integral-based stochastic optimal control for rigid body dynamics

Author :

Theodorou, E.A. ; Buchli, J. ; Schaal, S.

Author_Institution :

Comput. Sci., Neurosci., & Biomed. Eng., Univ. of Southern California, Los Angeles, CA

fYear :

2009

fDate :

March 30 2009-April 2 2009

Firstpage :

219

Lastpage :

225

Abstract :

Recent advances on path integral stochastic optimal control [1],[2] provide new insights in the optimal control of nonlinear stochastic systems which are linear in the controls, with state independent and time invariant control transition matrix. Under these assumptions, the Hamilton-Jacobi-Bellman (HJB) equation is formulated and linearized with the use of the logarithmic transformation of the optimal value function. The resulting HJB is a linear second order partial differential equation which is solved by an approximation based on the Feynman-Kac formula [3]. In this work we review the theory of path integral control and derive the linearized HJB equation for systems with state dependent control transition matrix. In addition we derive the path integral formulation for the general class of systems with state dimensionality that is higher than the dimensionality of the controls. Furthermore, by means of a modified inverse dynamics controller, we apply path integral stochastic optimal control over the new control space. Simulations illustrate the theoretical results. Future developments and extensions are discussed.

Keywords :

matrix algebra; nonlinear control systems; optimal control; partial differential equations; stochastic systems; transforms; Hamilton-Jacobi-Bellman equation; logarithmic transformation; nonlinear stochastic systems; partial differential equation; path integral stochastic optimal control; rigid body dynamics; state independent control transition matrix; time invariant control transition matrix; Control systems; Cost function; Humanoid robots; Integral equations; Learning; Nonlinear control systems; Optimal control; Sampling methods; Stochastic processes; Stochastic systems;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Adaptive Dynamic Programming and Reinforcement Learning, 2009. ADPRL '09. IEEE Symposium on

Conference_Location :

Nashville, TN

Print_ISBN :

978-1-4244-2761-1

Type :

conf

DOI :

10.1109/ADPRL.2009.4927548

Filename :

4927548

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=493381