• DocumentCode
    358907
  • Title

    Approximating networks, dynamic programming and stochastic approximation

  • Author

    Baglietto, M. ; Cervellera, C. ; Parisini, T. ; Sanguineti, M. ; Zoppoli, R.

  • Author_Institution
    Dept. of Commun., Comput. & Syst. Sci., Genova Univ., Italy
  • Volume
    5
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    3304
  • Abstract
    Approximate solution of a general N-stage stochastic optimal control problem is considered. It is known that discretizing uniformly the state components in applying dynamic programming may lead this procedure to incur the “curse of dimensionality”. Approximating networks, i.e., linear combinations of parametrized basis functions provided with density properties in some normed linear spaces, are then defined and used in two approximate methods (examples of such networks are neural networks with one hidden layer and linear output activation functions, radial basis functions, etc.). The first one consists of approximating the optimal cost-to-go functions in dynamic programming (such a technique is known in literature as “neuro-dynamic programming”); the second method reduces the original functional optimization problem to a nonlinear programming one that is solved by means of stochastic approximation. Approximating networks of suitable types benefit by the property that the number of parameters to be optimized and the number of samples to be used for approximating some classes of regular functions increase only linearly (or moderately) with the of dimensions of the arguments of the functions and the number of samples used to train the networks. We deem that such properties may enable us to solve N-stage stochastic optimal problems often avoiding the curse of dimensionality. The two methods are tested end compared in an example involving a 10-dimension state vector
  • Keywords
    approximation theory; dynamic programming; neural nets; optimal control; stochastic systems; approximating networks; general N-stage stochastic optimal control problem; linear output activation functions; neuro-dynamic programming; optimal cost-to-go functions; parametrized basis functions; radial basis functions; stochastic approximation; Dynamic programming; Ear; Equations; Functional programming; Neural networks; Optimal control; Reservoirs; State-space methods; Stochastic processes; Stochastic systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    American Control Conference, 2000. Proceedings of the 2000
  • Conference_Location
    Chicago, IL
  • ISSN
    0743-1619
  • Print_ISBN
    0-7803-5519-9
  • Type

    conf

  • DOI
    10.1109/ACC.2000.879176
  • Filename
    879176