• DocumentCode
    2465511
  • Title

    Improving tractability of POMDPs by separation of decision and perceptual processes

  • Author

    Fakoor, Rasool ; Huber, Manfred

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
  • fYear
    2012
  • fDate
    14-17 Oct. 2012
  • Firstpage
    593
  • Lastpage
    598
  • Abstract
    Markov Decision Processes (MDPs) and Partially Observable Markov Decision Processes (POMDPs) are very powerful frameworks to model decision and decision learning tasks in a wide range of problem domains. Thus, they are used widely in complex and real-world situations such as robot control tasks. However, this modeling power and generality of the framework comes at a cost in that the complexity of the underlying model and corresponding algorithms grows dramatically as the complexity of the task domain increases. To address this issue in the context of tasks where raw sensory features are used as a basis for complex decision making, this paper presents an integrated and adaptive approach that attempts to reduce the complexity of the decision learning problem by separating the POMDP model into separate decision and perceptual processes. In the proposed framework, a sampling method is used for the perceptual process and reinforcement learning serves to address the decision process. Handling the perceptual and decision processes separately here promises the potential to make it easier to extract relevant perceptual information and concentrate the decision process on relevant state attributes. This, in turn, promises to allow the framework to scale to problems in which traditional POMDP methods are intractable. We show and discuss the effectiveness of our method analytically and empirically.
  • Keywords
    Markov processes; decision making; decision theory; learning (artificial intelligence); sampling methods; POMDP; adaptive approach; complex decision making; complexity reduction; decision learning problem; decision learning task; integrated approach; partially observable Markov decision processes; perceptual process; raw sensory feature; reinforcement learning; robot control task; sampling method; tractability improvement; Atmospheric measurements; Complexity theory; Equations; Particle measurements; Robot kinematics; Uncertainty; Complexity; Decision Process; POMDP; Perceptual Process;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4673-1713-9
  • Electronic_ISBN
    978-1-4673-1712-2
  • Type

    conf

  • DOI
    10.1109/ICSMC.2012.6377790
  • Filename
    6377790