Title :
A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems
Author :
Todorov, Emanuel ; Li, Weiwei
Author_Institution :
Dept. of Cognitive Sci., California Univ., San Diego, CA, USA
Abstract :
We present an iterative linear-quadratic-Gaussian method for locally-optimal feedback control of nonlinear stochastic systems subject to control constraints. Previously, similar methods have been restricted to deterministic unconstrained problems with quadratic costs. The new method constructs an affine feedback control law, obtained by minimizing a novel quadratic approximation to the optimal cost-to-go function. Global convergence is guaranteed through a Levenberg-Marquardt method; convergence in the vicinity of a local minimum is quadratic. Performance is illustrated on a limited-torque inverted pendulum problem, as well as a complex biomechanical control problem involving a stochastic model of the human arm, with 10 state dimensions and 6 muscle actuators. A Matlab implementation of the new algorithm is availabe at www.cogsci.ucsd.edu/∼todorov.
Keywords :
actuators; biomechanics; constraint theory; convergence; feedback; iterative methods; linear quadratic Gaussian control; muscle; nonlinear systems; optimal control; pendulums; quadratic programming; stochastic systems; Levenberg-Marquardt method; Matlab; biomechanical control problem; constrained nonlinear stochastic systems; control constraints; deterministic unconstrained problems; generalized iterative LQG method; global convergence; human arm; iterative linear-quadratic-Gaussian method; limited-torque inverted pendulum; locally-optimal feedback control; muscle actuators; optimal cost-to-go function; quadratic approximation; quadratic cost; state dimensions; stochastic model; Control systems; Convergence; Costs; Feedback control; Iterative methods; Linear feedback control systems; Mathematical model; Nonlinear control systems; Stochastic processes; Stochastic systems;
Conference_Titel :
American Control Conference, 2005. Proceedings of the 2005
Print_ISBN :
0-7803-9098-9
Electronic_ISBN :
0743-1619
DOI :
10.1109/ACC.2005.1469949