• DocumentCode
    295123
  • Title

    Model-free control of nonlinear stochastic systems in discrete time

  • Author

    Spall, James C. ; Cristion, John A.

  • Author_Institution
    Appl. Phys. Lab., Johns Hopkins Univ., Laurel, MD, USA
  • Volume
    3
  • fYear
    1995
  • fDate
    13-15 Dec 1995
  • Firstpage
    2199
  • Abstract
    Consider the problem of developing a controller for general (nonlinear and stochastic) discrete-time systems, where the equations governing the system are unknown. This paper presents an approach based on estimating a controller without building or assuming a model for the system. Such an approach has potential advantages in, e.g., accommodating systems with time-varying dynamics. The controller is constructed through use of a function approximator (FA) such as a neural network or polynomial (no FA is used for the unmodeled system equations). This involves the estimation of the unknown parameters within the FA. However, since no functional form is being assumed for the system equations, the gradient of the loss function for use in standard optimization algorithms is not available. Therefore, this paper considers the use of a stochastic approximation algorithm that is based on a simultaneous perturbation gradient approximation, which requires only system measurements (not a system model). Related to this, a convergence result for stochastic approximation algorithms with time-varying objective functions is established. It is shown that this algorithm can greatly enhance the efficiency over more standard stochastic approximation algorithms based on finite-difference gradient approximations
  • Keywords
    control system synthesis; discrete time systems; function approximation; nonlinear control systems; stochastic systems; convergence; function approximator; loss function gradient; model-free control; neural network; nonlinear stochastic discrete-time systems; polynomial; simultaneous perturbation gradient approximation; stochastic approximation algorithm; time-varying dynamics; time-varying objective functions; Approximation algorithms; Buildings; Control systems; Neural networks; Nonlinear control systems; Nonlinear dynamical systems; Nonlinear equations; Stochastic processes; Stochastic systems; Time varying systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 1995., Proceedings of the 34th IEEE Conference on
  • Conference_Location
    New Orleans, LA
  • ISSN
    0191-2216
  • Print_ISBN
    0-7803-2685-7
  • Type

    conf

  • DOI
    10.1109/CDC.1995.480529
  • Filename
    480529