مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2005955

Title :

Classifier-Based Policy Representation

Author :

Rexakis, Ioannis ; Lagoudakis, Michail G.

Author_Institution :

Dept. of Electron. & Comput. Eng., Tech. Univ. of Crete, Chania

fYear :

2008

fDate :

11-13 Dec. 2008

Firstpage :

Lastpage :

Abstract :

Motivated by recent proposals that view a reinforcement learning problem as a collection of classification problems, we investigate various aspects of policy representation using classifiers. In particular, we derive optimal policies for two standard reinforcement learning domains (inverted pendulum and mountain car) in both deterministic and stochastic versions and we examine their internal structure. We then proceed in an evaluation of the representational ability of a variety of classifiers for these policies, using both a multi-class and a binary formulation of the classification problem. Finally, we evaluate the actual performance of the policies learned by the classifiers in the original control problem as a function of the amount of training examples provided. Our results offer significant insight in making the reinforcement-learning-via-classification technology successfully applicable to hard learning problems.

Keywords :

learning (artificial intelligence); pattern classification; classifier-based policy representation; inverted pendulum; mountain car; reinforcement-learning-via-classification technology; Application software; Intelligent systems; Laboratories; Learning systems; Machine learning; Proposals; Space technology; State-space methods; Stochastic processes; Training data; Classification; Policy Representation; Reinforcement Learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on

Conference_Location :

San Diego, CA

Print_ISBN :

978-0-7695-3495-4

Type :

conf

DOI :

10.1109/ICMLA.2008.31

Filename :

4724960

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2005955