• DocumentCode
    3600966
  • Title

    Energy Efficient Execution of POMDP Policies

  • Author

    Grzes, Marek ; Poupart, Pascal ; Xiao Yang ; Hoey, Jesse

  • Author_Institution
    David R. Cheriton Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON, Canada
  • Volume
    45
  • Issue
    11
  • fYear
    2015
  • Firstpage
    2484
  • Lastpage
    2497
  • Abstract
    Recent advances in planning techniques for partially observable Markov decision processes (POMDPs) have focused on online search techniques and offline point-based value iteration. While these techniques allow practitioners to obtain policies for fairly large problems, they assume that a nonnegligible amount of computation can be done between each decision point. In contrast, the recent proliferation of mobile and embedded devices has lead to a surge of applications that could benefit from state-of-the-art planning techniques if they can operate under severe constraints on computational resources. To that effect, we describe two techniques to compile policies into controllers that can be executed by a mere table lookup at each decision point. The first approach compiles policies induced by a set of alpha vectors (such as those obtained by point-based techniques) into approximately equivalent controllers, while the second approach performs a simulation to compile arbitrary policies into approximately equivalent controllers. We also describe an approach to compress controllers by removing redundant and dominated nodes, often yielding smaller and yet better controllers. Further compression and higher value can sometimes be obtained by considering stochastic controllers. The compilation and compression techniques are demonstrated on benchmark problems as well as a mobile application to help persons with Alzheimer´s to way-find. The battery consumption of several POMDP policies is compared against finite-state controllers learned using methods introduced in this paper. Experiments performed on the Nexus 4 phone show that finite-state controllers are the least battery consuming POMDP policies.
  • Keywords
    Markov processes; benchmark testing; decision making; finite state machines; iterative methods; mobile handsets; observability; power aware computing; stochastic systems; table lookup; Nexus 4 phone; POMDP policies; alpha vectors; battery consumption; compilation techniques; compression techniques; computational resources; decision point; embedded devices; energy efficient execution; finite-state controllers; mobile devices; offline point-based value iteration; online search techniques; partially observable Markov decision process; point-based techniques; stochastic controllers; table lookup; Approximation algorithms; History; Indexes; Mobile communication; Planning; Power capacitors; Vectors; Energy-efficiency; Markov decision processes; finite-state controllers; knowledge compilation; mobile applications; partially observable Markov decision processes (POMDPs);
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2014.2375817
  • Filename
    6985527