• DocumentCode
    2821035
  • Title

    The BAO* algorithm for stochastic shortest path problems with dynamic learning

  • Author

    Aksakalli, Vural

  • Author_Institution
    Johns Hopkins Univ., Baltimore
  • fYear
    2007
  • fDate
    12-14 Dec. 2007
  • Firstpage
    6003
  • Lastpage
    6008
  • Abstract
    Suppose a spatial arrangement of possible obstacles needs to be traversed as swiftly as possible, and the status of the obstacles may be disambiguated en route at a cost. The goal is to find a protocol that decides what and where to disambiguate en route so as to minimize the expected length of the traversal. We call this problem the stochastic shortest path problem with dynamic learning (SDL), which has been shown to be intractable in many broad settings. In this manuscript, we establish a framework for SDL in both continuous and discrete settings and cast the problem as a Markov decision process. The state space, however, is too large to efficiently utilize the stochastic dynamic programming paradigm. We introduce an algorithm for a discretized version of the continuous setting, called the BAO* Algorithm, which is a new improvement on the AO* search algorithm that employs stronger pruning techniques, including utilization of upper bounds on path lengths (in addition to lower bounds as in AO*), and uses significantly less computational resources. The BAO* Algorithm is not polynomial-time, but it can dramatically shorten the execution time needed to find an exact solution to moderately-sized instances of the problem.
  • Keywords
    Markov processes; collision avoidance; dynamic programming; graph theory; search problems; stochastic programming; BAO* algorithm; Markov decision process; dynamic learning; pruning techniques; stochastic dynamic programming; stochastic shortest path problems; Cost function; Delta modulation; Dynamic programming; Polynomials; Protocols; Shortest path problem; State-space methods; Stochastic processes; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control, 2007 46th IEEE Conference on
  • Conference_Location
    New Orleans, LA
  • ISSN
    0191-2216
  • Print_ISBN
    978-1-4244-1497-0
  • Electronic_ISBN
    0191-2216
  • Type

    conf

  • DOI
    10.1109/CDC.2007.4434409
  • Filename
    4434409