مرکز منطقه ای اطلاع رساني علوم و فناوري - Self-Segmentation of Sequences algorithm using eligibility traces in POMDP environments

DocumentCode :

417047

Title :

Self-Segmentation of Sequences algorithm using eligibility traces in POMDP environments

Author :

Kamaya, H. ; Abe, K.

Author_Institution :

Dept. of Electr. Eng., Hachinohe Nat. Coll. of Technol., Japan

Volume :

fYear :

2003

fDate :

4-6 Aug. 2003

Firstpage :

2010

Abstract :

This paper presents a new hierarchical reinforcement learning (RL) algorithm to speed up learning, to make it more robust to hidden states, and to handle non-deterministic problems. This on-line RL algorithm is called SSS(/spl lambda/), which extended Sun and Sessions´s original Self-Segmentation of Sequences (SSS) algorithm using eligibility traces. SSS(/spl lambda/) is compared with the original SSS for partially observable navigation tasks. The results of extensive simulations demonstrate that SSS(/spl lambda/) is clearly outperforming the original SSS.

Keywords :

Markov processes; learning (artificial intelligence); sequences; eligibility traces; hidden states; hierarchical reinforcement learning algorithm; nondeterministic problems; partially observable Markov decision processes environments; partially observable navigation; self segmentation; sequence algorithm;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

SICE 2003 Annual Conference

Conference_Location :

Fukui, Japan

Print_ISBN :

0-7803-8352-4

Type :

conf

Filename :

1324290

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=417047