Title :
Consolidated actor-critic model for partially-observable Markov decision processes
Author :
Elhanany, I. ; Niedzwiedz, C. ; Liu, Zhe ; Livingston, S.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Univ. of Tennessee, Knoxville, TN
Abstract :
A method for consolidating the traditionally separate actor and critic neural networks in temporal difference learning for addressing partially-observable Markov decision processes (POMDPs) is presented. Simulation results for solving a five-state POMDP problem support the claim that the consolidated model achieves higher performance while reducing computational and storage requirements to approximately half those of the traditional approach.
Keywords :
Markov processes; decision theory; Markov decision processes; actor-critic model; critic neural networks; temporal difference learning; traditionally separate actor;
Journal_Title :
Electronics Letters
DOI :
10.1049/el:20081346