• DocumentCode
    1400965
  • Title

    Solving Continuous-State POMDPs via Density Projection

  • Author

    Zhou, Enlu ; Fu, Michael C. ; Marcus, Steven I.

  • Author_Institution
    Dept. of Ind. & Enterprise Syst. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • Volume
    55
  • Issue
    5
  • fYear
    2010
  • fDate
    5/1/2010 12:00:00 AM
  • Firstpage
    1101
  • Lastpage
    1116
  • Abstract
    Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on finite-state models, and these algorithms do not generally extend to continuous-state POMDPs, due to the infinite dimensionality of the belief space. In this paper, we develop a computationally viable and theoretically sound method for solving continuous-state POMDPs by effectively reducing the dimensionality of the belief space via density projection. The density projection technique is also incorporated into particle filtering to provide a filtering scheme for online decision making. We provide an error bound between the value function induced by the policy obtained by our method and the true value function of the POMDP, and also an error bound between projection particle filtering and exact filtering. Finally, we illustrate the effectiveness of our method through an inventory control problem.
  • Keywords
    Markov processes; data reduction; particle filtering (numerical methods); belief space; continuous-state POMDP; density projection; dimensionality reduction; error bound; finite state model; infinite dimensionality; numerical solution; online decision making; partially observable Markov decision process; particle filtering; true value function; Aerospace industry; Cost function; Decision making; Educational institutions; Filtering; History; Inventory control; Probability distribution; Sampling methods; State-space methods; Systems engineering and theory; Uncertainty; Belief state; decision making; density projection; partially observable Markov decision processes (POMDPs); particle filtering; value function;
  • fLanguage
    English
  • Journal_Title
    Automatic Control, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9286
  • Type

    jour

  • DOI
    10.1109/TAC.2010.2042005
  • Filename
    5404339