• DocumentCode
    1265189
  • Title

    Feedback Control by Online Learning an Inverse Model

  • Author

    Waegeman, T. ; Wyffels, Francis ; Schrauwen, Benjamin

  • Author_Institution
    Dept. of Electron. & Inf. Syst., Ghent Univ., Ghent, Belgium
  • Volume
    23
  • Issue
    10
  • fYear
    2012
  • Firstpage
    1637
  • Lastpage
    1648
  • Abstract
    A model, predictor, or error estimator is often used by a feedback controller to control a plant. Creating such a model is difficult when the plant exhibits nonlinear behavior. In this paper, a novel online learning control framework is proposed that does not require explicit knowledge about the plant. This framework uses two learning modules, one for creating an inverse model, and the other for actually controlling the plant. Except for their inputs, they are identical. The inverse model learns by the exploration performed by the not yet fully trained controller, while the actual controller is based on the currently learned model. The proposed framework allows fast online learning of an accurate controller. The controller can be applied on a broad range of tasks with different dynamic characteristics. We validate this claim by applying our control framework on several control tasks: 1) the heating tank problem (slow nonlinear dynamics); 2) flight pitch control (slow linear dynamics); and 3) the balancing problem of a double inverted pendulum (fast linear and nonlinear dynamics). The results of these experiments show that fast learning and accurate control can be achieved. Furthermore, a comparison is made with some classical control approaches, and observations concerning convergence and stability are made.
  • Keywords
    control engineering computing; feedback; learning (artificial intelligence); nonlinear control systems; stability; balancing problem; double inverted pendulum; dynamic characteristic; feedback control; flight pitch control; heating tank problem; inverse model; nonlinear behavior; online learning control; slow linear dynamics; slow nonlinear dynamics; Artificial neural networks; Asymptotic stability; Neurons; Reservoirs; Stability analysis; Training; Water heating; Adaptive control; feedback control; heating tank; inverted pendulum; neural network; pitch control; reservoir computing;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2012.2208655
  • Filename
    6269107