Robust Reinforcement Learning Control Using Integral Quadratic Constraints for Recurrent Neural Networks

Author

Anderson, Charles W. ; Young, Peter Michael ; Buehner, Michael R. ; Knight, James N. ; Bush, Keith A. ; Hittle, Douglas C.

Author_Institution

Colorado State Univ., Fort Collins

Volume

18

Issue

4

fYear

2007

fDate

7/1/2007 12:00:00 AM

Firstpage

993

Lastpage

1002

Abstract

The applicability of machine learning techniques for feedback control systems is limited by a lack of stability guarantees. Robust control theory offers a framework for analyzing the stability of feedback control loops, but for the integral quadratic constraint (IQC) framework used here, all components are required to be represented as linear, time-invariant systems plus uncertainties with, for IQCs used here, bounded gain. In this paper, the stability of a control loop including a recurrent neural network (NN) is analyzed by replacing the nonlinear and time-varying components of the NN with IQCs on their gain. As a result, a range of the NN´s weights is found within which stability is guaranteed. An algorithm is demonstrated for training the recurrent NN using reinforcement learning and guaranteeing stability while learning.

Keywords

adaptive control; feedback; learning (artificial intelligence); linear systems; neurocontrollers; recurrent neural nets; robust control; time-varying systems; uncertain systems; PI control; adaptive control; convex optimization; feedback control system; integral quadratic constraints; linear time-invariant systems; machine learning; nonlinear component; proportional integral control; recurrent neural networks; reinforcement learning control; robust control; stability guarantee; time-varying component; uncertain system; Constraint theory; Feedback control; Integral equations; Machine learning; Neural networks; Recurrent neural networks; Robust control; Robust stability; Stability analysis; Uncertainty; Integral quadratic constraints (IQCs); recurrent neural networks (NNs); reinforcement learning; robust control; Algorithms; Artificial Intelligence; Biomimetics; Computer Simulation; Decision Support Techniques; Feedback; Least-Squares Analysis; Markov Chains; Models, Theoretical; Neural Networks (Computer); Reinforcement (Psychology);

fLanguage

English

Journal_Title

Neural Networks, IEEE Transactions on

Publisher

ieee

ISSN

1045-9227

Type

jour

DOI

10.1109/TNN.2007.899520

Filename

4267699