Training Recurrent Neurocontrollers for Robustness With Derivative-Free Kalman Filter

Author

Prokhorov, D.V.

Author_Institution

Ford Res. & Adv. Eng., Dearborn, MI

Volume

17

Issue

6

fYear

2006

Firstpage

1606

Lastpage

1616

Abstract

We are interested in training neurocontrollers for robustness on discrete-time models of physical systems. Our neurocontrollers are implemented as recurrent neural networks (RNNs). A model of the system to be controlled is known to the extent of parameters and/or signal uncertainties. Parameter values are drawn from a known distribution. For each instance of the model with specified parameters, a recurrent neurocontroller is trained by evaluating sensitivities of the model outputs to perturbations of the neurocontroller weights and incrementally updating the weights. Our training process strives to minimize a quadratic cost function averaged over many different models. In the end, the process yields a robust recurrent neurocontroller, which is ready for deployment with fixed weights. We employ a derivative-free Kalman filter algorithm proposed by Norgaard and extended by Feldkamp (2001) and Feldkamp (2002) to neural network training. Our training algorithm combines effectiveness of a second-order training method with universal applicability to both differentiable and nondifferentiable systems. Our approach is that of model reference control, and it extends significantly the capabilities proposed by Prokhorov (2001). We illustrate it with two examples

Keywords

Kalman filters; discrete time systems; neurocontrollers; recurrent neural nets; robust control; derivative-free Kalman filter algorithm; discrete-time models; model reference control; training process; training recurrent neurocontrollers; Adaptive control; Control system synthesis; Mathematical model; Neural networks; Neurocontrollers; Power system modeling; Programmable control; Recurrent neural networks; Robustness; Uncertainty; Derivative-free Kalman filter; neurocontroller; recurrent neural network (RNN); training for robustness; Algorithms; Computer Simulation; Computer Systems; Decision Support Techniques; Feedback; Models, Theoretical; Neural Networks (Computer);

fLanguage

English

Journal_Title

Neural Networks, IEEE Transactions on

Publisher

ieee

ISSN

1045-9227

Type

jour

DOI

10.1109/TNN.2006.880580

Filename

4012041