Efficient robust policy optimization

Author

Atkeson, Christopher G.

Author_Institution

Robot. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear

2012

fDate

27-29 June 2012

Firstpage

5220

Lastpage

5227

Abstract

We provide efficient algorithms to calculate first and second order gradients of the cost of a control law with respect to its parameters, to speed up policy optimization. We achieve robustness by simultaneously designing one control law for multiple models with potentially different model structures, which represent model uncertainty and unmodeled dynamics. Providing explicit examples of possible unmodeled dynamics during the control design process is easier for the designer and is more effective than providing simulated perturbations to increase robustness, as is currently done in machine learning. Our approach supports the design of deterministic nonlinear and time varying controllers for both deterministic and stochastic nonlinear and time varying systems, including policies with internal state such as observers or other state estimators. We highlight the benefit of control laws made up of collections of simple policies where only one component policy is active at a time. Controller optimization and learning is particularly fast and effective in this situation because derivatives are decoupled.

Keywords

control system synthesis; learning (artificial intelligence); nonlinear control systems; optimisation; state estimation; stochastic systems; control design process; control law design; controller optimization; deterministic nonlinear controllers; deterministic systems; efficient robust policy optimization; first order gradients; internal state; machine learning; model structures; model uncertainty; multiple models; observers; second order gradients; simulated perturbations; state estimators; stochastic nonlinear systems; time varying controllers; time varying systems; unmodeled dynamics; Computational modeling; Cost function; Equations; Mathematical model; Trajectory; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

American Control Conference (ACC), 2012

Conference_Location

Montreal, QC

ISSN

0743-1619

Print_ISBN

978-1-4577-1095-7

Electronic_ISBN

0743-1619

Type

conf

DOI

10.1109/ACC.2012.6315619

Filename

6315619