DocumentCode
3486774
Title
Efficient robust policy optimization
Author
Atkeson, Christopher G.
Author_Institution
Robot. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
2012
fDate
27-29 June 2012
Firstpage
5220
Lastpage
5227
Abstract
We provide efficient algorithms to calculate first and second order gradients of the cost of a control law with respect to its parameters, to speed up policy optimization. We achieve robustness by simultaneously designing one control law for multiple models with potentially different model structures, which represent model uncertainty and unmodeled dynamics. Providing explicit examples of possible unmodeled dynamics during the control design process is easier for the designer and is more effective than providing simulated perturbations to increase robustness, as is currently done in machine learning. Our approach supports the design of deterministic nonlinear and time varying controllers for both deterministic and stochastic nonlinear and time varying systems, including policies with internal state such as observers or other state estimators. We highlight the benefit of control laws made up of collections of simple policies where only one component policy is active at a time. Controller optimization and learning is particularly fast and effective in this situation because derivatives are decoupled.
Keywords
control system synthesis; learning (artificial intelligence); nonlinear control systems; optimisation; state estimation; stochastic systems; control design process; control law design; controller optimization; deterministic nonlinear controllers; deterministic systems; efficient robust policy optimization; first order gradients; internal state; machine learning; model structures; model uncertainty; multiple models; observers; second order gradients; simulated perturbations; state estimators; stochastic nonlinear systems; time varying controllers; time varying systems; unmodeled dynamics; Computational modeling; Cost function; Equations; Mathematical model; Trajectory; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
American Control Conference (ACC), 2012
Conference_Location
Montreal, QC
ISSN
0743-1619
Print_ISBN
978-1-4577-1095-7
Electronic_ISBN
0743-1619
Type
conf
DOI
10.1109/ACC.2012.6315619
Filename
6315619
Link To Document