Improving tractability of POMDPs by separation of decision and perceptual processes

Author

Fakoor, Rasool ; Huber, Manfred

Author_Institution

Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA

fYear

2012

fDate

14-17 Oct. 2012

Firstpage

593

Lastpage

598

Abstract

Markov Decision Processes (MDPs) and Partially Observable Markov Decision Processes (POMDPs) are very powerful frameworks to model decision and decision learning tasks in a wide range of problem domains. Thus, they are used widely in complex and real-world situations such as robot control tasks. However, this modeling power and generality of the framework comes at a cost in that the complexity of the underlying model and corresponding algorithms grows dramatically as the complexity of the task domain increases. To address this issue in the context of tasks where raw sensory features are used as a basis for complex decision making, this paper presents an integrated and adaptive approach that attempts to reduce the complexity of the decision learning problem by separating the POMDP model into separate decision and perceptual processes. In the proposed framework, a sampling method is used for the perceptual process and reinforcement learning serves to address the decision process. Handling the perceptual and decision processes separately here promises the potential to make it easier to extract relevant perceptual information and concentrate the decision process on relevant state attributes. This, in turn, promises to allow the framework to scale to problems in which traditional POMDP methods are intractable. We show and discuss the effectiveness of our method analytically and empirically.

Keywords

Markov processes; decision making; decision theory; learning (artificial intelligence); sampling methods; POMDP; adaptive approach; complex decision making; complexity reduction; decision learning problem; decision learning task; integrated approach; partially observable Markov decision processes; perceptual process; raw sensory feature; reinforcement learning; robot control task; sampling method; tractability improvement; Atmospheric measurements; Complexity theory; Equations; Particle measurements; Robot kinematics; Uncertainty; Complexity; Decision Process; POMDP; Perceptual Process;

fLanguage

English

Publisher

ieee

Conference_Titel

Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on

Conference_Location

Seoul

Print_ISBN

978-1-4673-1713-9

Electronic_ISBN

978-1-4673-1712-2

Type

conf

DOI

10.1109/ICSMC.2012.6377790

Filename

6377790