مرکز منطقه ای اطلاع رساني علوم و فناوري - Integrating Value-Directed Compression and Belief Space Analysis for POMDP Decomposition

DocumentCode :

3102143

Title :

Integrating Value-Directed Compression and Belief Space Analysis for POMDP Decomposition

Author :

Li, Xin ; Cheung, William K. ; Liu, Jiming

Author_Institution :

Dept. of Comput. Sci., Hong Kong Baptist Univ., Hong Kong

fYear :

2006

fDate :

18-22 Dec. 2006

Firstpage :

Lastpage :

Abstract :

Partially observable Markov decision process (POMDP) is a commonly adopted framework to model planning problems for agents to act in a stochastic environment. Obtaining the optimal policy of POMDP for large-scale problems is known to be intractable, where the high dimension of its belief state is one of the major causes. The use of the compression approach has recently been shown to be promising in tackling the curse of dimensionality problem. In this paper, a novel value-directed belief compression technique is proposed,together with clustering of belief states for further reducing the underlying computational complexity. We first cluster some sampled belief states into disjoint partitions and then apply a non-negative matrix factorization (NMF) based projection to each belief state cluster for dimension reduction. We then compute the optimal policy is then computed using a pointed-based value iteration algorithm defined in the low-dimensional projected belief state space. The proposed algorithm has been evaluated using a synthesized navigation problem. Solutions with quality comparable to the original POMDP were obtained at a much lower computational cost.

Keywords :

Markov processes; belief networks; data compression; data reduction; decision theory; iterative methods; matrix decomposition; multi-agent systems; planning (artificial intelligence); POMDP decomposition; belief space analysis; belief state cluster; computational complexity; dimension reduction; intelligent agent; model planning problem; nonnegative matrix factorization; partially observable Markov decision process; pointed-based value iteration algorithm; stochastic environment; value-directed belief compression; Clustering algorithms; Computational efficiency; Computer science; History; Intelligent agent; Large-scale systems; Partitioning algorithms; Probability distribution; State-space methods; Stochastic processes;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Agent Technology, 2006. IAT '06. IEEE/WIC/ACM International Conference on

Conference_Location :

Hong Kong

Print_ISBN :

0-7695-2748-5

Type :

conf

DOI :

10.1109/IAT.2006.81

Filename :

4052897

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3102143