• DocumentCode
    921473
  • Title

    Hidden state and reinforcement learning with instance-based state identification

  • Author

    McCallum, R. Andrew

  • Author_Institution
    Dept. of Comput. Sci., Rochester Univ., NY, USA
  • Volume
    26
  • Issue
    3
  • fYear
    1996
  • fDate
    6/1/1996 12:00:00 AM
  • Firstpage
    464
  • Lastpage
    473
  • Abstract
    Real robots with real sensors are not omniscient. When a robot´s next course of action depends on information that is hidden from the sensors because of problems such as occlusion, restricted range, bounded field of view and limited attention, we say the robot suffers from the hidden state problem. State identification techniques use history information to uncover hidden state. Some previous approaches to encoding history include: finite state machines, recurrent neural networks and genetic programming with indexed memory. A chief disadvantage of all these techniques is their long training time. This paper presents instance-based state identification, a new approach to reinforcement learning with state identification that learns with much fewer training steps. Noting that learning with history and learning in continuous spaces both share the property that they begin without knowing the granularity of the state space, the approach applies instance-based (or “memory-based”) learning to history sequences-instead of recording instances in a continuous geometrical space, we record instances in action-percept-reward sequence space. The first implementation of this approach, called Nearest Sequence Memory, learns with an order of magnitude fewer steps than several previous approaches
  • Keywords
    identification; learning (artificial intelligence); robots; Nearest Sequence Memory; action-percept-reward sequence space; instance-based state identification; reinforcement learning; robots; state identification; training steps; Automata; Elevators; Encoding; History; Hospitals; Learning; Orbital robotics; Robot sensing systems; Sensor phenomena and characterization; Transceivers;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/3477.499796
  • Filename
    499796