DocumentCode
1400965
Title
Solving Continuous-State POMDPs via Density Projection
Author
Zhou, Enlu ; Fu, Michael C. ; Marcus, Steven I.
Author_Institution
Dept. of Ind. & Enterprise Syst. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
Volume
55
Issue
5
fYear
2010
fDate
5/1/2010 12:00:00 AM
Firstpage
1101
Lastpage
1116
Abstract
Research on numerical solution methods for partially observable Markov decision processes (POMDPs) has primarily focused on finite-state models, and these algorithms do not generally extend to continuous-state POMDPs, due to the infinite dimensionality of the belief space. In this paper, we develop a computationally viable and theoretically sound method for solving continuous-state POMDPs by effectively reducing the dimensionality of the belief space via density projection. The density projection technique is also incorporated into particle filtering to provide a filtering scheme for online decision making. We provide an error bound between the value function induced by the policy obtained by our method and the true value function of the POMDP, and also an error bound between projection particle filtering and exact filtering. Finally, we illustrate the effectiveness of our method through an inventory control problem.
Keywords
Markov processes; data reduction; particle filtering (numerical methods); belief space; continuous-state POMDP; density projection; dimensionality reduction; error bound; finite state model; infinite dimensionality; numerical solution; online decision making; partially observable Markov decision process; particle filtering; true value function; Aerospace industry; Cost function; Decision making; Educational institutions; Filtering; History; Inventory control; Probability distribution; Sampling methods; State-space methods; Systems engineering and theory; Uncertainty; Belief state; decision making; density projection; partially observable Markov decision processes (POMDPs); particle filtering; value function;
fLanguage
English
Journal_Title
Automatic Control, IEEE Transactions on
Publisher
ieee
ISSN
0018-9286
Type
jour
DOI
10.1109/TAC.2010.2042005
Filename
5404339
Link To Document