• DocumentCode
    1798126
  • Title

    An approach to exploit non-optimized data for efficient control of unknown systems through neural and kernel models

  • Author

    Cervellera, Cristiano ; Gaggero, Mauro ; Maccio, Danilo ; Marcialis, Roberto

  • Author_Institution
    Inst. of Intell. Syst. for Autom., Genoa, Italy
  • fYear
    2014
  • fDate
    6-11 July 2014
  • Firstpage
    1856
  • Lastpage
    1863
  • Abstract
    In this paper, efficient real time control strategies are devised for systems with unknown state equation, based only on a set of data inherited from non-optimized, possibly inefficient, operation of the system, in the case in which experimenting online with the latter is impossible or costly. Neural networks and kernel smoothing models are employed as architectures for learning the system dynamics. The former require an offline training phase to learn the state equation, whereas the latter exploit the available data in a direct fashion, thus making the proposed approach directly applicable online and able to exploit new available data without the need of an offline training. Convergence properties of the proposed algorithm for generating the control strategies are provided under suitable hypotheses. Simulation results on classic benchmark systems are reported for performance evaluation, also through a comparison with the SARSA reinforcement learning algorithm.
  • Keywords
    convergence; learning (artificial intelligence); neural nets; smoothing methods; SARSA reinforcement learning algorithm; convergence properties; kernel models; kernel smoothing models; neural models; neural networks; nonoptimized data; offline training phase; performance evaluation; real time control strategies; system dynamics; system operation; unknown state equation; unknown systems control; Approximation methods; Data models; Equations; Kernel; Mathematical model; Trajectory; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), 2014 International Joint Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4799-6627-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.2014.6889794
  • Filename
    6889794