DocumentCode
2465511
Title
Improving tractability of POMDPs by separation of decision and perceptual processes
Author
Fakoor, Rasool ; Huber, Manfred
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
fYear
2012
fDate
14-17 Oct. 2012
Firstpage
593
Lastpage
598
Abstract
Markov Decision Processes (MDPs) and Partially Observable Markov Decision Processes (POMDPs) are very powerful frameworks to model decision and decision learning tasks in a wide range of problem domains. Thus, they are used widely in complex and real-world situations such as robot control tasks. However, this modeling power and generality of the framework comes at a cost in that the complexity of the underlying model and corresponding algorithms grows dramatically as the complexity of the task domain increases. To address this issue in the context of tasks where raw sensory features are used as a basis for complex decision making, this paper presents an integrated and adaptive approach that attempts to reduce the complexity of the decision learning problem by separating the POMDP model into separate decision and perceptual processes. In the proposed framework, a sampling method is used for the perceptual process and reinforcement learning serves to address the decision process. Handling the perceptual and decision processes separately here promises the potential to make it easier to extract relevant perceptual information and concentrate the decision process on relevant state attributes. This, in turn, promises to allow the framework to scale to problems in which traditional POMDP methods are intractable. We show and discuss the effectiveness of our method analytically and empirically.
Keywords
Markov processes; decision making; decision theory; learning (artificial intelligence); sampling methods; POMDP; adaptive approach; complex decision making; complexity reduction; decision learning problem; decision learning task; integrated approach; partially observable Markov decision processes; perceptual process; raw sensory feature; reinforcement learning; robot control task; sampling method; tractability improvement; Atmospheric measurements; Complexity theory; Equations; Particle measurements; Robot kinematics; Uncertainty; Complexity; Decision Process; POMDP; Perceptual Process;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4673-1713-9
Electronic_ISBN
978-1-4673-1712-2
Type
conf
DOI
10.1109/ICSMC.2012.6377790
Filename
6377790
Link To Document