DocumentCode
2821035
Title
The BAO* algorithm for stochastic shortest path problems with dynamic learning
Author
Aksakalli, Vural
Author_Institution
Johns Hopkins Univ., Baltimore
fYear
2007
fDate
12-14 Dec. 2007
Firstpage
6003
Lastpage
6008
Abstract
Suppose a spatial arrangement of possible obstacles needs to be traversed as swiftly as possible, and the status of the obstacles may be disambiguated en route at a cost. The goal is to find a protocol that decides what and where to disambiguate en route so as to minimize the expected length of the traversal. We call this problem the stochastic shortest path problem with dynamic learning (SDL), which has been shown to be intractable in many broad settings. In this manuscript, we establish a framework for SDL in both continuous and discrete settings and cast the problem as a Markov decision process. The state space, however, is too large to efficiently utilize the stochastic dynamic programming paradigm. We introduce an algorithm for a discretized version of the continuous setting, called the BAO* Algorithm, which is a new improvement on the AO* search algorithm that employs stronger pruning techniques, including utilization of upper bounds on path lengths (in addition to lower bounds as in AO*), and uses significantly less computational resources. The BAO* Algorithm is not polynomial-time, but it can dramatically shorten the execution time needed to find an exact solution to moderately-sized instances of the problem.
Keywords
Markov processes; collision avoidance; dynamic programming; graph theory; search problems; stochastic programming; BAO* algorithm; Markov decision process; dynamic learning; pruning techniques; stochastic dynamic programming; stochastic shortest path problems; Cost function; Delta modulation; Dynamic programming; Polynomials; Protocols; Shortest path problem; State-space methods; Stochastic processes; Upper bound;
fLanguage
English
Publisher
ieee
Conference_Titel
Decision and Control, 2007 46th IEEE Conference on
Conference_Location
New Orleans, LA
ISSN
0191-2216
Print_ISBN
978-1-4244-1497-0
Electronic_ISBN
0191-2216
Type
conf
DOI
10.1109/CDC.2007.4434409
Filename
4434409
Link To Document