DocumentCode
1047456
Title
Robust Reinforcement Learning Control Using Integral Quadratic Constraints for Recurrent Neural Networks
Author
Anderson, Charles W. ; Young, Peter Michael ; Buehner, Michael R. ; Knight, James N. ; Bush, Keith A. ; Hittle, Douglas C.
Author_Institution
Colorado State Univ., Fort Collins
Volume
18
Issue
4
fYear
2007
fDate
7/1/2007 12:00:00 AM
Firstpage
993
Lastpage
1002
Abstract
The applicability of machine learning techniques for feedback control systems is limited by a lack of stability guarantees. Robust control theory offers a framework for analyzing the stability of feedback control loops, but for the integral quadratic constraint (IQC) framework used here, all components are required to be represented as linear, time-invariant systems plus uncertainties with, for IQCs used here, bounded gain. In this paper, the stability of a control loop including a recurrent neural network (NN) is analyzed by replacing the nonlinear and time-varying components of the NN with IQCs on their gain. As a result, a range of the NN´s weights is found within which stability is guaranteed. An algorithm is demonstrated for training the recurrent NN using reinforcement learning and guaranteeing stability while learning.
Keywords
adaptive control; feedback; learning (artificial intelligence); linear systems; neurocontrollers; recurrent neural nets; robust control; time-varying systems; uncertain systems; PI control; adaptive control; convex optimization; feedback control system; integral quadratic constraints; linear time-invariant systems; machine learning; nonlinear component; proportional integral control; recurrent neural networks; reinforcement learning control; robust control; stability guarantee; time-varying component; uncertain system; Constraint theory; Feedback control; Integral equations; Machine learning; Neural networks; Recurrent neural networks; Robust control; Robust stability; Stability analysis; Uncertainty; Integral quadratic constraints (IQCs); recurrent neural networks (NNs); reinforcement learning; robust control; Algorithms; Artificial Intelligence; Biomimetics; Computer Simulation; Decision Support Techniques; Feedback; Least-Squares Analysis; Markov Chains; Models, Theoretical; Neural Networks (Computer); Reinforcement (Psychology);
fLanguage
English
Journal_Title
Neural Networks, IEEE Transactions on
Publisher
ieee
ISSN
1045-9227
Type
jour
DOI
10.1109/TNN.2007.899520
Filename
4267699
Link To Document