An approach to exploit non-optimized data for efficient control of unknown systems through neural and kernel models

Author

Cervellera, Cristiano ; Gaggero, Mauro ; Maccio, Danilo ; Marcialis, Roberto

Author_Institution

Inst. of Intell. Syst. for Autom., Genoa, Italy

fYear

2014

fDate

6-11 July 2014

Firstpage

1856

Lastpage

1863

Abstract

In this paper, efficient real time control strategies are devised for systems with unknown state equation, based only on a set of data inherited from non-optimized, possibly inefficient, operation of the system, in the case in which experimenting online with the latter is impossible or costly. Neural networks and kernel smoothing models are employed as architectures for learning the system dynamics. The former require an offline training phase to learn the state equation, whereas the latter exploit the available data in a direct fashion, thus making the proposed approach directly applicable online and able to exploit new available data without the need of an offline training. Convergence properties of the proposed algorithm for generating the control strategies are provided under suitable hypotheses. Simulation results on classic benchmark systems are reported for performance evaluation, also through a comparison with the SARSA reinforcement learning algorithm.

Keywords

convergence; learning (artificial intelligence); neural nets; smoothing methods; SARSA reinforcement learning algorithm; convergence properties; kernel models; kernel smoothing models; neural models; neural networks; nonoptimized data; offline training phase; performance evaluation; real time control strategies; system dynamics; system operation; unknown state equation; unknown systems control; Approximation methods; Data models; Equations; Kernel; Mathematical model; Trajectory; Vectors;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks (IJCNN), 2014 International Joint Conference on

Conference_Location

Beijing

Print_ISBN

978-1-4799-6627-1

Type

conf

DOI

10.1109/IJCNN.2014.6889794

Filename

6889794