• DocumentCode
    2709985
  • Title

    Learning to generate subgoals for action sequences

  • Author

    Schmidhuber, Jürgen

  • Author_Institution
    Inst. fur Inf., Tech. Univ. Muenchen, Germany
  • fYear
    1991
  • fDate
    8-14 Jul 1991
  • Abstract
    Summary form only given. None of the existing learning algorithms for neural networks in time-varying environments addresses the problems of learning to `divide and conquer´. Algorithms based on pure gradient descent or on adaptive critic methods are not suitable for dynamic control problems with long time lags between actions and consequences, and that there is a need for algorithms that perform `compositional learning´. The author discusses a system which solves at least one problem associated with compositional learning. The system learns to generate subgoals. This is done with the help of `time-bridging´ adaptive models that predict the effects of the system´s subprograms. An experiment on obstacle avoidance in a two-dimensional environment illustrates the approach
  • Keywords
    adaptive systems; learning systems; neural nets; planning (artificial intelligence); time-varying systems; action sequences; adaptive critic methods; compositional learning; divide and conquer method; dynamic control problems; gradient descent; learning algorithms; neural networks; obstacle avoidance; subgoals; subprogram effects prediction; time bridging adaptive models; time lags; time-varying environments; Adaptive control; Neural networks; Permission; Predictive models; Programmable control;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1991., IJCNN-91-Seattle International Joint Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    0-7803-0164-1
  • Type

    conf

  • DOI
    10.1109/IJCNN.1991.155375
  • Filename
    155375