Title :
Robust Reinforcement Learning Control Using Integral Quadratic Constraints for Recurrent Neural Networks
Author :
Anderson, Charles W. ; Young, Peter Michael ; Buehner, Michael R. ; Knight, James N. ; Bush, Keith A. ; Hittle, Douglas C.
Author_Institution :
Colorado State Univ., Fort Collins
fDate :
7/1/2007 12:00:00 AM
Abstract :
The applicability of machine learning techniques for feedback control systems is limited by a lack of stability guarantees. Robust control theory offers a framework for analyzing the stability of feedback control loops, but for the integral quadratic constraint (IQC) framework used here, all components are required to be represented as linear, time-invariant systems plus uncertainties with, for IQCs used here, bounded gain. In this paper, the stability of a control loop including a recurrent neural network (NN) is analyzed by replacing the nonlinear and time-varying components of the NN with IQCs on their gain. As a result, a range of the NN´s weights is found within which stability is guaranteed. An algorithm is demonstrated for training the recurrent NN using reinforcement learning and guaranteeing stability while learning.
Keywords :
adaptive control; feedback; learning (artificial intelligence); linear systems; neurocontrollers; recurrent neural nets; robust control; time-varying systems; uncertain systems; PI control; adaptive control; convex optimization; feedback control system; integral quadratic constraints; linear time-invariant systems; machine learning; nonlinear component; proportional integral control; recurrent neural networks; reinforcement learning control; robust control; stability guarantee; time-varying component; uncertain system; Constraint theory; Feedback control; Integral equations; Machine learning; Neural networks; Recurrent neural networks; Robust control; Robust stability; Stability analysis; Uncertainty; Integral quadratic constraints (IQCs); recurrent neural networks (NNs); reinforcement learning; robust control; Algorithms; Artificial Intelligence; Biomimetics; Computer Simulation; Decision Support Techniques; Feedback; Least-Squares Analysis; Markov Chains; Models, Theoretical; Neural Networks (Computer); Reinforcement (Psychology);
Journal_Title :
Neural Networks, IEEE Transactions on
DOI :
10.1109/TNN.2007.899520