• DocumentCode
    2415715
  • Title

    RTMBA: A Real-Time Model-Based Reinforcement Learning Architecture for robot control

  • Author

    Hester, Todd ; Quinlan, Michael ; Stone, Peter

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA
  • fYear
    2012
  • fDate
    14-18 May 2012
  • Firstpage
    85
  • Lastpage
    90
  • Abstract
    Reinforcement Learning (RL) is a paradigm for learning decision-making tasks that could enable robots to learn and adapt to their situation on-line. For an RL algorithm to be practical for robotic control tasks, it must learn in very few samples, while continually taking actions in real-time. Existing model-based RL methods learn in relatively few samples, but typically take too much time between each action for practical on-line learning. In this paper, we present a novel parallel architecture for model-based RL that runs in real-time by 1) taking advantage of sample-based approximate planning methods and 2) parallelizing the acting, model learning, and planning processes in a novel way such that the acting process is sufficiently fast for typical robot control cycles. We demonstrate that algorithms using this architecture perform nearly as well as methods using the typical sequential architecture when both are given unlimited time, and greatly out-perform these methods on tasks that require real-time actions such as controlling an autonomous vehicle.
  • Keywords
    approximation theory; control engineering computing; decision making; learning (artificial intelligence); parallel architectures; real-time systems; robots; RL; RTMBA; autonomous vehicle; decision-making learning; parallel architecture; planning processes; real-time model-based reinforcement learning architecture; robot control; sample based approximate planning methods; Approximation algorithms; Computational modeling; Multicore processing; Planning; Real time systems; Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation (ICRA), 2012 IEEE International Conference on
  • Conference_Location
    Saint Paul, MN
  • ISSN
    1050-4729
  • Print_ISBN
    978-1-4673-1403-9
  • Electronic_ISBN
    1050-4729
  • Type

    conf

  • DOI
    10.1109/ICRA.2012.6225072
  • Filename
    6225072