Title :
Iterated LQR smoothing for locally-optimal feedback control of systems with non-linear dynamics and non-quadratic cost
Author :
van den Berg, Jan
Author_Institution :
Sch. of Comput., Univ. of Utah, Salt Lake City, UT, USA
Abstract :
This paper introduces the novel concept of LQR smoothing, which analogous to Kalman smoothing consists of both a backward pass and a forward pass. In the backward pass the cost-to-go function is computed using the standard LQR Riccati equation that runs backward in time, and in the forward pass the cost-to-come function is computed using a Riccati equation that runs forward in time. The sum of the cost-to-go and the cost-to-come function gives the total-cost function, and we will show that the states for which the total-cost function is minimal constitute the minimum-cost trajectory for the linear-quadratic optimal control problem. This insight is used to construct a fast-converging iterative procedure to compute a locally-optimal feedback control policy for systems with non-linear dynamics and non-quadratic cost, where in each iteration the current minimal-total-cost states provide natural points about which the dynamics can be linearized and the cost quadratized. We demonstrate the potential of our approach on two illustrative non-linear control problems involving physical differential-drive robots and simulated quadrotor helicopters in environments with obstacles, and show that our approach converges in only about a third of the number of iterations required by existing approaches such as Iterative LQR.
Keywords :
Kalman filters; Riccati equations; autonomous aerial vehicles; feedback; helicopters; iterative methods; linear quadratic control; nonlinear control systems; smoothing methods; Kalman smoothing; backward pass; cost-to-come function; cost-to-go function; forward pass; iterated LQR smoothing; iterative procedure; linear-quadratic optimal control problem; locally-optimal feedback control; minimum-cost trajectory; nonlinear control problems; nonlinear dynamics; nonquadratic cost; physical differential-drive robots; quadrotor helicopters; standard LQR Riccati equation; total-cost function; Convergence; Cost function; Feedback control; Kalman filters; Optimal control; Smoothing methods; Trajectory; Optimal control; Predictive control for linear systems; Predictive control for nonlinear systems;
Conference_Titel :
American Control Conference (ACC), 2014
Conference_Location :
Portland, OR
Print_ISBN :
978-1-4799-3272-6
DOI :
10.1109/ACC.2014.6859404