• DocumentCode
    2017373
  • Title

    Search space reduction for strategy learning in sequential decision processes

  • Author

    Schoknecht, Ralf ; Spott, Martin ; Liekweg, Florian ; Riedmiller, Martin

  • Author_Institution
    Fakultat fur Inf., Karlsruhe Univ., Germany
  • Volume
    1
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    148
  • Abstract
    Sequential decision making in large domains requires high computational expense. With the classical dynamic programming approach, a rising problem size soon leads to intractability because of time and memory constraints. This situation can be significantly remedied by using more advanced reinforcement learning techniques in combination with generalizing function approximators. However, this may lead to unstable learning behaviour as the strict convergence results are no longer valid. The paper presents an approach to stabilize learning by gradually reducing the search space for the optimal decision policy. This is done by iteratively adapting the action set according to the progress of learning. Experiments are described within the FYNESSE control architecture that is a framework for autonomously learning adaptive control strategies
  • Keywords
    adaptive control; decision theory; dynamic programming; learning (artificial intelligence); search problems; FYNESSE control architecture; action set; adaptive control strategies; advanced reinforcement learning techniques; autonomous learning; classical dynamic programming approach; computational expense; generalizing function approximators; intractability; large domains; memory constraints; optimal decision policy; problem size; search space reduction; sequential decision making; sequential decision processes; strategy learning; strict convergence results; unstable learning behaviour; Adaptive control; Analytical models; Control systems; Convergence; Decision making; Delay; Learning; Nonlinear control systems; Operations research; State estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Information Processing, 1999. Proceedings. ICONIP '99. 6th International Conference on
  • Conference_Location
    Perth, WA
  • Print_ISBN
    0-7803-5871-6
  • Type

    conf

  • DOI
    10.1109/ICONIP.1999.843977
  • Filename
    843977