• DocumentCode
    399709
  • Title

    A robot that reinforcement-learns to identify and memorize important previous observations

  • Author

    Bakker, Bram ; Zhumatiy, Viktor ; Gruener, G. ; Schmidhuber, Jiirgen

  • Author_Institution
    IDSIA, Manno-Lugano, Switzerland
  • Volume
    1
  • fYear
    2003
  • fDate
    27-31 Oct. 2003
  • Firstpage
    430
  • Abstract
    It is difficult to apply traditional reinforcement learning algorithms to robots, due to problems with large and continuous domains, partial observability, and limited numbers of learning experiences. This paper deals with these problems by combining: (1) reinforcement learning with memory, implemented using an LSTM recurrent neural network whose inputs are discrete events extracted from raw inputs; (2) online exploration and offline policy learning. An experiment with a real robot demonstrates the methodology´s feasibility.
  • Keywords
    discrete event systems; learning (artificial intelligence); recurrent neural nets; robots; discrete events; offline policy learning; online exploration; partial observability; recurrent neural network; reinforcement learning; Actuators; Algorithm design and analysis; Cameras; Data mining; Observability; Recurrent neural networks; Robot control; Robot sensing systems; Robot vision systems; Supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2003. (IROS 2003). Proceedings. 2003 IEEE/RSJ International Conference on
  • Print_ISBN
    0-7803-7860-1
  • Type

    conf

  • DOI
    10.1109/IROS.2003.1250667
  • Filename
    1250667