مرکز منطقه ای اطلاع رساني علوم و فناوري - Using Learning Classifier Systems to Learn Stochastic Decision Policies

DocumentCode :

3601650

Title :

Using Learning Classifier Systems to Learn Stochastic Decision Policies

Author :

Gang Chen ; Douch, Colin I. J. ; Mengjie Zhang

Author_Institution :

Sch. of Eng. & Comput. Sci., Victoria Univ. of Wellington, Wellington, New Zealand

Volume :

Issue :

fYear :

2015

Firstpage :

885

Lastpage :

902

Abstract :

To solve reinforcement learning problems, many learning classifier systems (LCSs) are designed to learn state-action value functions through a compact set of maximally general and accurate rules. Most of these systems focus primarily on learning deterministic policies by using a greedy action selection strategy. However, in practice, it may be more flexible and desirable to learn stochastic policies, which can be considered as direct extensions of their deterministic counterparts. In this paper, we aim to achieve this goal by extending each rule with a new policy parameter. Meanwhile, a new method for adaptive learning of stochastic action selection strategies based on a policy gradient framework has also been introduced. Using this method, we have developed two new learning systems, one based on a regular gradient learning technology and the other based on a new natural gradient learning method. Both learning systems have been evaluated on three different types of reinforcement learning problems. The promising performance of the two systems clearly shows that LCSs provide a suitable platform for efficient and reliable learning of stochastic policies.

Keywords :

gradient methods; learning (artificial intelligence); pattern classification; stochastic processes; LCS; deterministic policy learning; greedy action selection strategy; learning classifier system; natural gradient learning method; policy gradient framework; regular gradient learning technology; reinforcement learning problems; state-action value functions; stochastic decision policy learning; Gradient methods; Learning (artificial intelligence); Learning systems; Reliability; Sociology; Statistics; Stochastic processes; Gradient methods; Learning systems; Stochastic systems; learning systems; stochastic systems;

fLanguage :

English

Journal_Title :

Evolutionary Computation, IEEE Transactions on

Publisher :

ieee

ISSN :

1089-778X

Type :

jour

DOI :

10.1109/TEVC.2015.2415464

Filename :

7065281

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3601650