• DocumentCode
    184706
  • Title

    Planning for large-scale multiagent problems via hierarchical decomposition with applications to UAV health management

  • Author

    Yu Fan Chen ; Ure, N. Kemal ; Chowdhary, Girish ; How, Jonathan P. ; Vian, John

  • Author_Institution
    Dept. of Aeronaut. & Astronaut., MIT, Cambridge, MA, USA
  • fYear
    2014
  • fDate
    4-6 June 2014
  • Firstpage
    1279
  • Lastpage
    1285
  • Abstract
    This paper introduces a novel hierarchical decomposition approach for solving Multiagent Markov Decision Processes (MMDPs) by exploiting coupling relationships in the reward function. MMDP is a natural framework for solving stochastic multi-stage multiagent decision-making problems, such as optimizing mission performance of Unmanned Aerial Vehicles (UAVs) with stochastic health dynamics. However, computing the optimal solutions is often intractable because the state-action spaces scale exponentially with the number of agents. Approximate solution techniques do exist, but they typically rely on extensive domain knowledge. This paper presents the Hierarchically Decomposed MMDP (HD-MMDP) algorithm, which autonomously identifies different degrees of coupling in the reward function and decomposes the MMDP into a hierarchy of smaller MDPs that can be solved separately. Solutions to the smaller MDPs are embedded in an autonomously constructed tree structure to generate an approximate solution to the original problem. Simulation results show HD-MMDP obtains more cumulative reward than that of the existing algorithm for a ten-agent Persistent Search and Track (PST) mission, which is a cooperative multi-UAV mission with more than 1019 states, stochastic fuel consumption model, and health progression model.
  • Keywords
    Markov processes; approximation theory; autonomous aerial vehicles; condition monitoring; decision making; multi-robot systems; path planning; rescue robots; robot dynamics; trees (mathematics); MMDP; UAV health management; cooperative multi UAV mission; coupling relationships; health progression model; hierarchically decomposed MMDP algorithm; large-scale multiagent problems; multiagent Markov decision processes; reward function; stochastic fuel consumption model; stochastic health dynamics; stochastic multistage multiagent decision-making problems; ten-agent persistent search-and-track mission; tree structure; unmanned aerial vehicles; Couplings; Fuels; Heuristic algorithms; Joints; Markov processes; Monitoring; Planning; Fault-tolerant systems; Markov processes; Stochastic systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    American Control Conference (ACC), 2014
  • Conference_Location
    Portland, OR
  • ISSN
    0743-1619
  • Print_ISBN
    978-1-4799-3272-6
  • Type

    conf

  • DOI
    10.1109/ACC.2014.6859242
  • Filename
    6859242