DocumentCode :
574818
Title :
Scalable, MDP-based planning for multiple, cooperating agents
Author :
Redding, J.D. ; Ure, N. Kemal ; How, Jonathan P. ; Vavrina, Matthew A. ; Vian, John
Author_Institution :
Aerosp. Controls Lab., MIT, Cambridge, MA, USA
fYear :
2012
fDate :
27-29 June 2012
Firstpage :
6011
Lastpage :
6016
Abstract :
This paper introduces an approximation algorithm for stochastic multi-agent planning based on Markov decision processes (MDPs). Specifically, we focus on a decentralized approach for planning the actions of a team of cooperating agents with uncertainties in fuel consumption and health-related models. The core idea behind the algorithm presented in this paper is to allow each agent to approximate the representation of its teammates. Each agent therefore maintains its own planner that fully enumerates its local states and actions while approximating those of its teammates. In prior work, the authors approximated each teammate individually, which resulted in a large reduction of the planning space, but remained exponential (in n - 1 rather than in n, where n is the number of agents) in computational scalability. This paper extends the approach and presents a new approximation that aggregates all teammates into a single, abstracted entity. Under the persistent search & track mission scenario with 3 agents, we show that while resulting performance is decreased nearly 20% compared with the centralized optimal solution, the problem size becomes linear in n, a very attractive feature when planning online for large multi-agent teams.
Keywords :
Markov processes; approximation theory; multi-agent systems; path planning; MDP-based planning; Markov decision process; approximation algorithm; cooperating agent; decentralized approach; stochastic multiagent planning; Actuators; Approximation methods; Fuels; Joints; Observability; Planning; Surveillance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
American Control Conference (ACC), 2012
Conference_Location :
Montreal, QC
ISSN :
0743-1619
Print_ISBN :
978-1-4577-1095-7
Electronic_ISBN :
0743-1619
Type :
conf
DOI :
10.1109/ACC.2012.6315482
Filename :
6315482
Link To Document :
بازگشت