مرکز منطقه ای اطلاع رساني علوم و فناوري - Quasi-online reinforcement learning for robots

DocumentCode :

2103206

Title :

Quasi-online reinforcement learning for robots

Author :

Bakker, Bram ; Zhumatiy, Viktor ; Gruener, Gabriel ; Schmidhuber, Jürgen

Author_Institution :

Informatics Inst., Amsterdam Univ.

fYear :

2006

fDate :

15-19 May 2006

Firstpage :

2997

Lastpage :

3002

Abstract :

This paper describes quasi-online reinforcement learning: while a robot is exploring its environment, in the background a probabilistic model of the environment is built on the fly as new experiences arrive; the policy is trained concurrently based on this model using an anytime algorithm. Prioritized sweeping, directed exploration, and transformed reward functions provide additional speed-ups. The robot quickly learns goal-directed policies from scratch, requiring few interactions with the environment and making efficient use of available computation time. From an outside perspective it learns the behavior online and in real time. We describe comparisons with standard methods and show the individual utility of each of the proposed techniques

Keywords :

learning (artificial intelligence); mobile robots; path planning; probability; directed exploration; prioritized sweeping; probabilistic model; quasi-online reinforcement learning; robots; transformed reward functions; Acceleration; Availability; Concurrent computing; Functional programming; Informatics; Learning; Robot programming; Robot sensing systems; Signal processing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation, 2006. ICRA 2006. Proceedings 2006 IEEE International Conference on

Conference_Location :

Orlando, FL

ISSN :

1050-4729

Print_ISBN :

0-7803-9505-0

Type :

conf

DOI :

10.1109/ROBOT.2006.1642157

Filename :

1642157

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2103206