• DocumentCode
    2863217
  • Title

    Decomposing large-scale POMDP via belief state analysis

  • Author

    Li, Xin ; Cheung, William K. ; Liu, Jiming

  • Author_Institution
    Dept. of Comput. Sci., Hong Kong Baptist Univ., China
  • fYear
    2005
  • fDate
    19-22 Sept. 2005
  • Firstpage
    428
  • Lastpage
    434
  • Abstract
    Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing the optimal policy for a large-scale POMDP is known to be intractable. Belief compression, being an approximate solution, has recently been proposed to reduce the dimension of POMDP´s belief state space and shown to be effective in improving the problem tractability. In this paper, with the conjecture that temporally close belief states could be characterized by a lower intrinsic dimension, we propose a spatio-temporal brief clustering that considers both the belief states´ spatial (in the belief space) and temporal similarities, as well as incorporate it into the belief compression algorithm. The proposed clustering results in belief state clusters as sub-POMDPs of much lower dimension so as to be distributed to a set of distributed agents for collaborative problem solving. The proposed method has been tested using a synthesized navigation problem (Hallway2) and empirically shown to be able to result in policies of superior long-term rewards when compared with those based on solely belief compression. Some future research directions for extending this belief state analysis approach are also included.
  • Keywords
    Markov processes; belief maintenance; decision making; multi-agent systems; problem solving; belief compression; belief state analysis; collaborative problem solving; distributed agent; optimal decision making; partially observable Markov decision process; spatio-temporal brief clustering; stochastic environment; synthesized navigation problem; unobservable state; Collaborative work; Compression algorithms; Computer science; Decision making; History; Large-scale systems; Principal component analysis; State-space methods; Stochastic processes; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Agent Technology, IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2416-8
  • Type

    conf

  • DOI
    10.1109/IAT.2005.63
  • Filename
    1565576