• DocumentCode
    354247
  • Title

    Research on reinforcement learning of the intelligent robot based on self-adaptive quantization

  • Author

    Rubo, ZHANG ; Yu, Sun ; Wang Xingoe ; Guangmin, Yang ; Guochang, Gu

  • Author_Institution
    Harbin Eng. Univ., China
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1226
  • Abstract
    The concept of the reinforcement learning comes from behavior psychology that takes behavior learning as trial and error, by which the states of the environment are mapped into corresponding actions. There is a question of how can the behaviourism be used to learn the actions in interaction with the environment in designing an intelligent robot. In the paper, the actions that the robot takes to avoid obstacles are taken as one class of behaviors and the reinforcement learning is used to realize behavior learning of obstacle avoidance. The quantization of the state space is very important in improving the robot´s learning speed. The SOM neural network is adopted to get self-adaptive quantization of the state space. The self-organization characteristic of the SOM neural network makes it possible to solve the adaptation problem and is flexible in space quantization. The reinforcement learning is used to settle the robot learning of collision avoidance behavior based on quantization of the state space and satisfying results are obtained
  • Keywords
    learning (artificial intelligence); robots; self-organising feature maps; state-space methods; SOM neural network; adaptation; avoidance; behavior learning; behavior psychology; behaviourism; intelligent robot; learning speed; obstacles avoidance; reinforcement learning; self-adaptive quantization; state space quantization; Intelligent robots; Learning; Quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation, 2000. Proceedings of the 3rd World Congress on
  • Conference_Location
    Hefei
  • Print_ISBN
    0-7803-5995-X
  • Type

    conf

  • DOI
    10.1109/WCICA.2000.863438
  • Filename
    863438