Learning to generate subgoals for action sequences

Author

Schmidhuber, Jürgen

Author_Institution

Inst. fur Inf., Tech. Univ. Muenchen, Germany

fYear

1991

fDate

8-14 Jul 1991

Abstract

Summary form only given. None of the existing learning algorithms for neural networks in time-varying environments addresses the problems of learning to `divide and conquer´. Algorithms based on pure gradient descent or on adaptive critic methods are not suitable for dynamic control problems with long time lags between actions and consequences, and that there is a need for algorithms that perform `compositional learning´. The author discusses a system which solves at least one problem associated with compositional learning. The system learns to generate subgoals. This is done with the help of `time-bridging´ adaptive models that predict the effects of the system´s subprograms. An experiment on obstacle avoidance in a two-dimensional environment illustrates the approach

Keywords

adaptive systems; learning systems; neural nets; planning (artificial intelligence); time-varying systems; action sequences; adaptive critic methods; compositional learning; divide and conquer method; dynamic control problems; gradient descent; learning algorithms; neural networks; obstacle avoidance; subgoals; subprogram effects prediction; time bridging adaptive models; time lags; time-varying environments; Adaptive control; Neural networks; Permission; Predictive models; Programmable control;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 1991., IJCNN-91-Seattle International Joint Conference on

Conference_Location

Seattle, WA

Print_ISBN

0-7803-0164-1

Type

conf

DOI

10.1109/IJCNN.1991.155375

Filename

155375