DocumentCode
816392
Title
Training Recurrent Neurocontrollers for Robustness With Derivative-Free Kalman Filter
Author
Prokhorov, D.V.
Author_Institution
Ford Res. & Adv. Eng., Dearborn, MI
Volume
17
Issue
6
fYear
2006
Firstpage
1606
Lastpage
1616
Abstract
We are interested in training neurocontrollers for robustness on discrete-time models of physical systems. Our neurocontrollers are implemented as recurrent neural networks (RNNs). A model of the system to be controlled is known to the extent of parameters and/or signal uncertainties. Parameter values are drawn from a known distribution. For each instance of the model with specified parameters, a recurrent neurocontroller is trained by evaluating sensitivities of the model outputs to perturbations of the neurocontroller weights and incrementally updating the weights. Our training process strives to minimize a quadratic cost function averaged over many different models. In the end, the process yields a robust recurrent neurocontroller, which is ready for deployment with fixed weights. We employ a derivative-free Kalman filter algorithm proposed by Norgaard and extended by Feldkamp (2001) and Feldkamp (2002) to neural network training. Our training algorithm combines effectiveness of a second-order training method with universal applicability to both differentiable and nondifferentiable systems. Our approach is that of model reference control, and it extends significantly the capabilities proposed by Prokhorov (2001). We illustrate it with two examples
Keywords
Kalman filters; discrete time systems; neurocontrollers; recurrent neural nets; robust control; derivative-free Kalman filter algorithm; discrete-time models; model reference control; training process; training recurrent neurocontrollers; Adaptive control; Control system synthesis; Mathematical model; Neural networks; Neurocontrollers; Power system modeling; Programmable control; Recurrent neural networks; Robustness; Uncertainty; Derivative-free Kalman filter; neurocontroller; recurrent neural network (RNN); training for robustness; Algorithms; Computer Simulation; Computer Systems; Decision Support Techniques; Feedback; Models, Theoretical; Neural Networks (Computer);
fLanguage
English
Journal_Title
Neural Networks, IEEE Transactions on
Publisher
ieee
ISSN
1045-9227
Type
jour
DOI
10.1109/TNN.2006.880580
Filename
4012041
Link To Document