Title :
Curious model-building control systems
Author :
Schmidhuber, Jürgen
Author_Institution :
Dept. of Comput. Sci., Colorado Univ., Boulder, CO, USA
Abstract :
A novel curious model-building control system is described which actively tries to provoke situations for which it learned to expect to learn something about the environment. Such a system has been implemented as a four-network system based on Watkins´ Q-learning algorithm which can be used to maximize the expectation of the temporal derivative of the adaptive assumed reliability of future predictions. An experiment with an artificial nondeterministic environment demonstrates that the system can be superior to previous model-building control systems, which do not address the problem of modeling the reliability of the world model´s predictions in uncertain environments and use ad-hoc methods (like random search) to train the world model
Keywords :
adaptive control; adaptive systems; learning systems; neural nets; Watkins´ Q-learning algorithm; adaptive assumed reliability; adaptive control; adaptive systems; artificial nondeterministic environment; four-network system; future predictions; learning systems; model-building control systems; temporal derivative; Adaptive control; Computer architecture; Computer networks; Computer science; Control system synthesis; Error correction; Learning; Manipulator dynamics; Predictive models; Programmable control;
Conference_Titel :
Neural Networks, 1991. 1991 IEEE International Joint Conference on
Print_ISBN :
0-7803-0227-3
DOI :
10.1109/IJCNN.1991.170605