DocumentCode :
417047
Title :
Self-Segmentation of Sequences algorithm using eligibility traces in POMDP environments
Author :
Kamaya, H. ; Abe, K.
Author_Institution :
Dept. of Electr. Eng., Hachinohe Nat. Coll. of Technol., Japan
Volume :
2
fYear :
2003
fDate :
4-6 Aug. 2003
Firstpage :
2010
Abstract :
This paper presents a new hierarchical reinforcement learning (RL) algorithm to speed up learning, to make it more robust to hidden states, and to handle non-deterministic problems. This on-line RL algorithm is called SSS(/spl lambda/), which extended Sun and Sessions´s original Self-Segmentation of Sequences (SSS) algorithm using eligibility traces. SSS(/spl lambda/) is compared with the original SSS for partially observable navigation tasks. The results of extensive simulations demonstrate that SSS(/spl lambda/) is clearly outperforming the original SSS.
Keywords :
Markov processes; learning (artificial intelligence); sequences; eligibility traces; hidden states; hierarchical reinforcement learning algorithm; nondeterministic problems; partially observable Markov decision processes environments; partially observable navigation; self segmentation; sequence algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SICE 2003 Annual Conference
Conference_Location :
Fukui, Japan
Print_ISBN :
0-7803-8352-4
Type :
conf
Filename :
1324290
Link To Document :
بازگشت