DocumentCode
574818
Title
Scalable, MDP-based planning for multiple, cooperating agents
Author
Redding, J.D. ; Ure, N. Kemal ; How, Jonathan P. ; Vavrina, Matthew A. ; Vian, John
Author_Institution
Aerosp. Controls Lab., MIT, Cambridge, MA, USA
fYear
2012
fDate
27-29 June 2012
Firstpage
6011
Lastpage
6016
Abstract
This paper introduces an approximation algorithm for stochastic multi-agent planning based on Markov decision processes (MDPs). Specifically, we focus on a decentralized approach for planning the actions of a team of cooperating agents with uncertainties in fuel consumption and health-related models. The core idea behind the algorithm presented in this paper is to allow each agent to approximate the representation of its teammates. Each agent therefore maintains its own planner that fully enumerates its local states and actions while approximating those of its teammates. In prior work, the authors approximated each teammate individually, which resulted in a large reduction of the planning space, but remained exponential (in n - 1 rather than in n, where n is the number of agents) in computational scalability. This paper extends the approach and presents a new approximation that aggregates all teammates into a single, abstracted entity. Under the persistent search & track mission scenario with 3 agents, we show that while resulting performance is decreased nearly 20% compared with the centralized optimal solution, the problem size becomes linear in n, a very attractive feature when planning online for large multi-agent teams.
Keywords
Markov processes; approximation theory; multi-agent systems; path planning; MDP-based planning; Markov decision process; approximation algorithm; cooperating agent; decentralized approach; stochastic multiagent planning; Actuators; Approximation methods; Fuels; Joints; Observability; Planning; Surveillance;
fLanguage
English
Publisher
ieee
Conference_Titel
American Control Conference (ACC), 2012
Conference_Location
Montreal, QC
ISSN
0743-1619
Print_ISBN
978-1-4577-1095-7
Electronic_ISBN
0743-1619
Type
conf
DOI
10.1109/ACC.2012.6315482
Filename
6315482
Link To Document