DocumentCode
2709985
Title
Learning to generate subgoals for action sequences
Author
Schmidhuber, Jürgen
Author_Institution
Inst. fur Inf., Tech. Univ. Muenchen, Germany
fYear
1991
fDate
8-14 Jul 1991
Abstract
Summary form only given. None of the existing learning algorithms for neural networks in time-varying environments addresses the problems of learning to `divide and conquer´. Algorithms based on pure gradient descent or on adaptive critic methods are not suitable for dynamic control problems with long time lags between actions and consequences, and that there is a need for algorithms that perform `compositional learning´. The author discusses a system which solves at least one problem associated with compositional learning. The system learns to generate subgoals. This is done with the help of `time-bridging´ adaptive models that predict the effects of the system´s subprograms. An experiment on obstacle avoidance in a two-dimensional environment illustrates the approach
Keywords
adaptive systems; learning systems; neural nets; planning (artificial intelligence); time-varying systems; action sequences; adaptive critic methods; compositional learning; divide and conquer method; dynamic control problems; gradient descent; learning algorithms; neural networks; obstacle avoidance; subgoals; subprogram effects prediction; time bridging adaptive models; time lags; time-varying environments; Adaptive control; Neural networks; Permission; Predictive models; Programmable control;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 1991., IJCNN-91-Seattle International Joint Conference on
Conference_Location
Seattle, WA
Print_ISBN
0-7803-0164-1
Type
conf
DOI
10.1109/IJCNN.1991.155375
Filename
155375
Link To Document