• DocumentCode
    3010929
  • Title

    Reinforcement Learning with a Supervisor for a Mobile Robot in a Real-world Environment

  • Author

    Conn, Karla ; Peters, Richard A.

  • Author_Institution
    Vanderbilt Univ., Nashville
  • fYear
    2007
  • fDate
    20-23 June 2007
  • Firstpage
    73
  • Lastpage
    78
  • Abstract
    This paper describes two experiments with supervised reinforcement learning (RL) on a real, mobile robot. Two types of experiments were preformed. One tests the robot´s reliability in implementing a navigation task it has been taught by a supervisor. The other, in which new obstacles are placed along the previously learned path to the goal, measures the robot´s robustness to changes in environment. Supervision consisted of human-guided, remote-controlled runs through a navigation task during the initial stages of reinforcement learning. The RL algorithms deployed enabled the robot to learn a path to a goal yet retain the ability to explore different solutions when confronted with a new obstacle. Experimental analysis was based on measurements of average time to reach the goal, the number of failed states encountered during an episode, and how closely the RL learner matched the supervisor´s actions.
  • Keywords
    control engineering computing; learning (artificial intelligence); mobile robots; navigation; human-guided remote-controlled runs; mobile robot; navigation task; real-world environment; robot reliability; supervised reinforcement learning; Computational intelligence; Computational modeling; Learning; Mobile robots; Navigation; Orbital robotics; Robotics and automation; State-space methods; Testing; USA Councils; Q-learning; Reinforcement learning; mobile robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Robotics and Automation, 2007. CIRA 2007. International Symposium on
  • Conference_Location
    Jacksonville, FI
  • Print_ISBN
    1-4244-0790-7
  • Electronic_ISBN
    1-4244-0790-7
  • Type

    conf

  • DOI
    10.1109/CIRA.2007.382878
  • Filename
    4269878