DocumentCode :
964489
Title :
Consolidated actor-critic model for partially-observable Markov decision processes
Author :
Elhanany, I. ; Niedzwiedz, C. ; Liu, Zhe ; Livingston, S.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of Tennessee, Knoxville, TN
Volume :
44
Issue :
22
fYear :
2008
Firstpage :
1317
Lastpage :
1318
Abstract :
A method for consolidating the traditionally separate actor and critic neural networks in temporal difference learning for addressing partially-observable Markov decision processes (POMDPs) is presented. Simulation results for solving a five-state POMDP problem support the claim that the consolidated model achieves higher performance while reducing computational and storage requirements to approximately half those of the traditional approach.
Keywords :
Markov processes; decision theory; Markov decision processes; actor-critic model; critic neural networks; temporal difference learning; traditionally separate actor;
fLanguage :
English
Journal_Title :
Electronics Letters
Publisher :
iet
ISSN :
0013-5194
Type :
jour
DOI :
10.1049/el:20081346
Filename :
4658763
Link To Document :
بازگشت