DocumentCode
417047
Title
Self-Segmentation of Sequences algorithm using eligibility traces in POMDP environments
Author
Kamaya, H. ; Abe, K.
Author_Institution
Dept. of Electr. Eng., Hachinohe Nat. Coll. of Technol., Japan
Volume
2
fYear
2003
fDate
4-6 Aug. 2003
Firstpage
2010
Abstract
This paper presents a new hierarchical reinforcement learning (RL) algorithm to speed up learning, to make it more robust to hidden states, and to handle non-deterministic problems. This on-line RL algorithm is called SSS(/spl lambda/), which extended Sun and Sessions´s original Self-Segmentation of Sequences (SSS) algorithm using eligibility traces. SSS(/spl lambda/) is compared with the original SSS for partially observable navigation tasks. The results of extensive simulations demonstrate that SSS(/spl lambda/) is clearly outperforming the original SSS.
Keywords
Markov processes; learning (artificial intelligence); sequences; eligibility traces; hidden states; hierarchical reinforcement learning algorithm; nondeterministic problems; partially observable Markov decision processes environments; partially observable navigation; self segmentation; sequence algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
SICE 2003 Annual Conference
Conference_Location
Fukui, Japan
Print_ISBN
0-7803-8352-4
Type
conf
Filename
1324290
Link To Document