DocumentCode :
1873663
Title :
An approximate algorithm for solving oracular POMDPs
Author :
Armstrong-Crews, Nicholas ; Veloso, Manuela
Author_Institution :
Robot. Inst., Carnegie Mellon Univ., Pittsburgh, PA
fYear :
2008
fDate :
19-23 May 2008
Firstpage :
3346
Lastpage :
3352
Abstract :
We propose a new approximate algorithm, LA- JIV (lookahead J-MDP information value), to solve oracular partially observable Markov decision problems (OPOMDPs), a special type of POMDP that rather than standard observations includes an "oracle" that can be consulted for full state information at a fixed cost. We previously introduced JIV (J-MDP information value) to solve OPOMDPs, an heuristic algorithm that utilizes the solution of the underlying MDP and weighs the value of consulting the oracle against the value of taking a state-modifying action. While efficient, JIV will rarely find the optimal solution. In this paper, we extend JIV to include lookahead, thereby permitting arbitrarily small deviation from the optimal policy\´s long-term expected reward at the cost of added computation time. The depth of the lookahead is a parameter that governs this tradeoff; by iteratively increasing this depth, we provide an anytime algorithm that yields an ever- improving solution. LA-JIV leverages the OPOMDP framework\´s unique characteristics to outperform general-purpose approximate POMDP solvers; in fact, we prove that LA-JIV is a poly-time approximation scheme (PTAS) with respect to the size of the state and observation spaces, thereby showing rigorously that OPOMDPs are "easier" than POMDPs. Finally, we substantiate our theoretical results via an empirical analysis of a benchmark OPOMDP instance.
Keywords :
Markov processes; approximation theory; decision making; intelligent robots; heuristic algorithm; oracular partially observable Markov decision problems; polytime approximation scheme; robots; time-varying stochastic process; Computer science; Costs; Heuristic algorithms; Medical robotics; Observability; Orbital robotics; Robot sensing systems; Robotics and automation; USA Councils; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Robotics and Automation, 2008. ICRA 2008. IEEE International Conference on
Conference_Location :
Pasadena, CA
ISSN :
1050-4729
Print_ISBN :
978-1-4244-1646-2
Electronic_ISBN :
1050-4729
Type :
conf
DOI :
10.1109/ROBOT.2008.4543721
Filename :
4543721
Link To Document :
بازگشت