Title :
Decomposing large-scale POMDP via belief state analysis
Author :
Li, Xin ; Cheung, William K. ; Liu, Jiming
Author_Institution :
Dept. of Comput. Sci., Hong Kong Baptist Univ., China
Abstract :
Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing the optimal policy for a large-scale POMDP is known to be intractable. Belief compression, being an approximate solution, has recently been proposed to reduce the dimension of POMDP´s belief state space and shown to be effective in improving the problem tractability. In this paper, with the conjecture that temporally close belief states could be characterized by a lower intrinsic dimension, we propose a spatio-temporal brief clustering that considers both the belief states´ spatial (in the belief space) and temporal similarities, as well as incorporate it into the belief compression algorithm. The proposed clustering results in belief state clusters as sub-POMDPs of much lower dimension so as to be distributed to a set of distributed agents for collaborative problem solving. The proposed method has been tested using a synthesized navigation problem (Hallway2) and empirically shown to be able to result in policies of superior long-term rewards when compared with those based on solely belief compression. Some future research directions for extending this belief state analysis approach are also included.
Keywords :
Markov processes; belief maintenance; decision making; multi-agent systems; problem solving; belief compression; belief state analysis; collaborative problem solving; distributed agent; optimal decision making; partially observable Markov decision process; spatio-temporal brief clustering; stochastic environment; synthesized navigation problem; unobservable state; Collaborative work; Compression algorithms; Computer science; Decision making; History; Large-scale systems; Principal component analysis; State-space methods; Stochastic processes; Testing;
Conference_Titel :
Intelligent Agent Technology, IEEE/WIC/ACM International Conference on
Print_ISBN :
0-7695-2416-8
DOI :
10.1109/IAT.2005.63