• DocumentCode
    299852
  • Title

    Vision-based reinforcement learning for purposive behavior acquisition

  • Author

    Asada, Minoru ; Noda, Shoichi ; Tawaratsumida, Sukoya ; Hosoda, Koh

  • Author_Institution
    Dept. of Mech. Eng. for Comput.-Controlled Machinery, Osaka Univ., Japan
  • Volume
    1
  • fYear
    1995
  • fDate
    21-27 May 1995
  • Firstpage
    146
  • Abstract
    This paper presents a method of vision-based reinforcement learning by which a robot learns to shoot a ball into a goal, and discusses several issues in applying the reinforcement learning method to a real robot with vision sensor. First, a “state-action deviation” problem is found as a form of perceptual aliasing in constructing the state and action spaces that reflect the outputs from physical sensors and actuators, respectively. To cope with this, an action set is constructed in such a way that one action consists of a series of the same action primitive which is successively executed until the current state changes. Next, to speed up the learning time, a mechanism of learning form easy missions (or LEM) which is a similar technique to “shaping” in animal learning is implemented. LEM reduces the learning time from the exponential order in the size of the state space to about the linear order in the size of the state space. The results of computer simulations and real robot experiments are given
  • Keywords
    digital simulation; learning (artificial intelligence); robot programming; robot vision; action primitive; action set; learning form easy missions; perceptual aliasing; purposive behavior acquisition; state-action deviation problem; vision sensor; vision-based reinforcement learning; Actuators; Computer vision; Machine learning; Mobile robots; Orbital robotics; Robot control; Robot sensing systems; Robot vision systems; Robotics and automation; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation, 1995. Proceedings., 1995 IEEE International Conference on
  • Conference_Location
    Nagoya
  • ISSN
    1050-4729
  • Print_ISBN
    0-7803-1965-6
  • Type

    conf

  • DOI
    10.1109/ROBOT.1995.525277
  • Filename
    525277