Decomposing large-scale POMDP via belief state analysis

Author

Li, Xin ; Cheung, William K. ; Liu, Jiming

Author_Institution

Dept. of Comput. Sci., Hong Kong Baptist Univ., China

fYear

2005

fDate

19-22 Sept. 2005

Firstpage

428

Lastpage

434

Abstract

Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing the optimal policy for a large-scale POMDP is known to be intractable. Belief compression, being an approximate solution, has recently been proposed to reduce the dimension of POMDP´s belief state space and shown to be effective in improving the problem tractability. In this paper, with the conjecture that temporally close belief states could be characterized by a lower intrinsic dimension, we propose a spatio-temporal brief clustering that considers both the belief states´ spatial (in the belief space) and temporal similarities, as well as incorporate it into the belief compression algorithm. The proposed clustering results in belief state clusters as sub-POMDPs of much lower dimension so as to be distributed to a set of distributed agents for collaborative problem solving. The proposed method has been tested using a synthesized navigation problem (Hallway2) and empirically shown to be able to result in policies of superior long-term rewards when compared with those based on solely belief compression. Some future research directions for extending this belief state analysis approach are also included.

Keywords

Markov processes; belief maintenance; decision making; multi-agent systems; problem solving; belief compression; belief state analysis; collaborative problem solving; distributed agent; optimal decision making; partially observable Markov decision process; spatio-temporal brief clustering; stochastic environment; synthesized navigation problem; unobservable state; Collaborative work; Compression algorithms; Computer science; Decision making; History; Large-scale systems; Principal component analysis; State-space methods; Stochastic processes; Testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Agent Technology, IEEE/WIC/ACM International Conference on

Print_ISBN

0-7695-2416-8

Type

conf

DOI

10.1109/IAT.2005.63

Filename

1565576