• DocumentCode
    2630251
  • Title

    Curious model-building control systems

  • Author

    Schmidhuber, Jürgen

  • Author_Institution
    Dept. of Comput. Sci., Colorado Univ., Boulder, CO, USA
  • fYear
    1991
  • fDate
    18-21 Nov 1991
  • Firstpage
    1458
  • Abstract
    A novel curious model-building control system is described which actively tries to provoke situations for which it learned to expect to learn something about the environment. Such a system has been implemented as a four-network system based on Watkins´ Q-learning algorithm which can be used to maximize the expectation of the temporal derivative of the adaptive assumed reliability of future predictions. An experiment with an artificial nondeterministic environment demonstrates that the system can be superior to previous model-building control systems, which do not address the problem of modeling the reliability of the world model´s predictions in uncertain environments and use ad-hoc methods (like random search) to train the world model
  • Keywords
    adaptive control; adaptive systems; learning systems; neural nets; Watkins´ Q-learning algorithm; adaptive assumed reliability; adaptive control; adaptive systems; artificial nondeterministic environment; four-network system; future predictions; learning systems; model-building control systems; temporal derivative; Adaptive control; Computer architecture; Computer networks; Computer science; Control system synthesis; Error correction; Learning; Manipulator dynamics; Predictive models; Programmable control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1991. 1991 IEEE International Joint Conference on
  • Print_ISBN
    0-7803-0227-3
  • Type

    conf

  • DOI
    10.1109/IJCNN.1991.170605
  • Filename
    170605