DocumentCode :
493381
Title :
Path integral-based stochastic optimal control for rigid body dynamics
Author :
Theodorou, E.A. ; Buchli, J. ; Schaal, S.
Author_Institution :
Comput. Sci., Neurosci., & Biomed. Eng., Univ. of Southern California, Los Angeles, CA
fYear :
2009
fDate :
March 30 2009-April 2 2009
Firstpage :
219
Lastpage :
225
Abstract :
Recent advances on path integral stochastic optimal control [1],[2] provide new insights in the optimal control of nonlinear stochastic systems which are linear in the controls, with state independent and time invariant control transition matrix. Under these assumptions, the Hamilton-Jacobi-Bellman (HJB) equation is formulated and linearized with the use of the logarithmic transformation of the optimal value function. The resulting HJB is a linear second order partial differential equation which is solved by an approximation based on the Feynman-Kac formula [3]. In this work we review the theory of path integral control and derive the linearized HJB equation for systems with state dependent control transition matrix. In addition we derive the path integral formulation for the general class of systems with state dimensionality that is higher than the dimensionality of the controls. Furthermore, by means of a modified inverse dynamics controller, we apply path integral stochastic optimal control over the new control space. Simulations illustrate the theoretical results. Future developments and extensions are discussed.
Keywords :
matrix algebra; nonlinear control systems; optimal control; partial differential equations; stochastic systems; transforms; Hamilton-Jacobi-Bellman equation; logarithmic transformation; nonlinear stochastic systems; partial differential equation; path integral stochastic optimal control; rigid body dynamics; state independent control transition matrix; time invariant control transition matrix; Control systems; Cost function; Humanoid robots; Integral equations; Learning; Nonlinear control systems; Optimal control; Sampling methods; Stochastic processes; Stochastic systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Adaptive Dynamic Programming and Reinforcement Learning, 2009. ADPRL '09. IEEE Symposium on
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-2761-1
Type :
conf
DOI :
10.1109/ADPRL.2009.4927548
Filename :
4927548
Link To Document :
بازگشت