• DocumentCode
    3486774
  • Title

    Efficient robust policy optimization

  • Author

    Atkeson, Christopher G.

  • Author_Institution
    Robot. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2012
  • fDate
    27-29 June 2012
  • Firstpage
    5220
  • Lastpage
    5227
  • Abstract
    We provide efficient algorithms to calculate first and second order gradients of the cost of a control law with respect to its parameters, to speed up policy optimization. We achieve robustness by simultaneously designing one control law for multiple models with potentially different model structures, which represent model uncertainty and unmodeled dynamics. Providing explicit examples of possible unmodeled dynamics during the control design process is easier for the designer and is more effective than providing simulated perturbations to increase robustness, as is currently done in machine learning. Our approach supports the design of deterministic nonlinear and time varying controllers for both deterministic and stochastic nonlinear and time varying systems, including policies with internal state such as observers or other state estimators. We highlight the benefit of control laws made up of collections of simple policies where only one component policy is active at a time. Controller optimization and learning is particularly fast and effective in this situation because derivatives are decoupled.
  • Keywords
    control system synthesis; learning (artificial intelligence); nonlinear control systems; optimisation; state estimation; stochastic systems; control design process; control law design; controller optimization; deterministic nonlinear controllers; deterministic systems; efficient robust policy optimization; first order gradients; internal state; machine learning; model structures; model uncertainty; multiple models; observers; second order gradients; simulated perturbations; state estimators; stochastic nonlinear systems; time varying controllers; time varying systems; unmodeled dynamics; Computational modeling; Cost function; Equations; Mathematical model; Trajectory; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    American Control Conference (ACC), 2012
  • Conference_Location
    Montreal, QC
  • ISSN
    0743-1619
  • Print_ISBN
    978-1-4577-1095-7
  • Electronic_ISBN
    0743-1619
  • Type

    conf

  • DOI
    10.1109/ACC.2012.6315619
  • Filename
    6315619