• DocumentCode
    493367
  • Title

    Executing concurrent actions with multiple Markov decision processes

  • Author

    Corona-Xelhuantzi, Elva ; Morales, Eduardo F. ; Sucar, Enrique

  • Author_Institution
    Dept. of Comput. Sci., Nat. Inst. of Astrophys., Opt. & Electron., Puebla
  • fYear
    2009
  • fDate
    March 30 2009-April 2 2009
  • Firstpage
    82
  • Lastpage
    89
  • Abstract
    Markov decision processes (MDPs) have become a standard method for planning under uncertainty, however they usually assume a sequential process, so a single action is executed at each time step. In some applications, as in robotics, it is required to execute several actions concurrently. For this we propose a framework based on a functional decomposition of the problem into several sub-problems, each represented as a subMDP. Each subMDP is solved independently and their policies are combined to obtain a global solution, such that the actions of each subMDP can be executed concurrently. As we combine the local policies, conflicts between them can arise. We define two kinds of conflicts, resource and behavior conflicts, and propose solutions for both. Resource conflicts are solved off-line via a two-phase process which guarantees a near-optimal global policy. Behavior conflicts are solved on-line based on a set of restrictions specified by the user. If there are no restrictions, all the actions are executed concurrently; otherwise, an arbiter selects the action(s) with higher expected utility. We present experimental results in two cases: (i) a simulated robot navigation problem, with resource conflicts, and (ii) a simulated robot in a message delivery task, with behavior conflicts.
  • Keywords
    Markov processes; path planning; robots; behavior conflicts; concurrent actions; functional decomposition; message delivery task; multiple Markov decision processes; resource conflicts; robot navigation; Astrophysics; Computational modeling; Computer science; Human robot interaction; Mobile robots; Navigation; Orbital robotics; Process planning; Robotics and automation; Utility theory; Concurrent Actions; Markov Decision Processes; Service Robots;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Adaptive Dynamic Programming and Reinforcement Learning, 2009. ADPRL '09. IEEE Symposium on
  • Conference_Location
    Nashville, TN
  • Print_ISBN
    978-1-4244-2761-1
  • Type

    conf

  • DOI
    10.1109/ADPRL.2009.4927529
  • Filename
    4927529