مرکز منطقه ای اطلاع رساني علوم و فناوري - Real time targeted exploration in large domains

DocumentCode :

2212496

Title :

Real time targeted exploration in large domains

Author :

Hester, Todd ; Stone, Peter

Author_Institution :

Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA

fYear :

2010

fDate :

18-21 Aug. 2010

Firstpage :

191

Lastpage :

196

Abstract :

A developing agent needs to explore to learn about the world and learn good behaviors. In many real world tasks, this exploration can take far too long, and the agent must make decisions about which states to explore, and which states not to explore. Bayesian methods attempt to address this problem, but take too much computation time to run in reasonably sized domains. In this paper, we present TEXPLORE, the first algorithm to perform targeted exploration in real time in large domains. The algorithm learns multiple possible models of the domain that generalize action effects across states. We experiment with possible ways of adding intrinsic motivation to the agent to drive exploration. TEXPLORE is fully implemented and tested in a novel domain called Fuel World that is designed to reflect the type of targeted exploration needed in the real world. We show that our algorithm significantly outperforms representative examples of both model-free and model-based RL algorithms from the literature and is able to quickly learn to perform well in a large world in real-time.

Keywords :

Bayes methods; decision making; learning (artificial intelligence); mobile agents; real-time systems; Bayesian method; TEXPLORE; agent learning; computational time; fuel world; intrinsic motivation; large domain; real time targeted exploration; reinforcement learning; Bayesian methods; Computational modeling; Decision trees; Fuels; Mathematical model; Prediction algorithms; Predictive models;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Development and Learning (ICDL), 2010 IEEE 9th International Conference on

Conference_Location :

Ann Arbor, MI

Print_ISBN :

978-1-4244-6900-0

Type :

conf

DOI :

10.1109/DEVLRN.2010.5578845

Filename :

5578845

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2212496