DocumentCode
2111240
Title
Lagrangian Relaxation for Large-Scale Multi-agent Planning
Author
Gordon, Geoffrey J. ; Varakantham, Pradeep ; Yeoh, Wai Siang ; Hoong Chuin Lau ; Aravamudhan, A.S. ; Shih-Fen Cheng
Author_Institution
Machine Learning Dept., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume
2
fYear
2012
fDate
4-7 Dec. 2012
Firstpage
494
Lastpage
501
Abstract
Multi-agent planning is a well-studied problem with various applications including disaster rescue, urban transportation and logistics, both for autonomous agents and for decision support to humans. Due to computational constraints, existing research typically focuses on one of two scenarios: unstructured domains with many agents where we are content with heuristic solutions, or domains with small numbers of agents or special structure where we can provide provably near-optimal solutions. By contrast, in this paper, we focus on providing provably near-optimal solutions for domains with large numbers of agents, by exploiting a common domain-general property: if individual agents each have limited influence on the overall solution quality, then we can take advantage of randomization and the resulting statistical concentration to show that each agent can safely plan based only on the average behavior of the other agents. To that end, we make two key contributions: (a) an algorithm, based on Lagrangian relaxation and randomized rounding, for solving multi-agent planning problems represented as large mixed-integer programs, (b) a proof of convergence of our algorithm to a near-optimal solution. We demonstrate the scalability of our approach with a large-scale illustrative theme park crowd management problem.
Keywords
integer programming; multi-agent systems; planning (artificial intelligence); Lagrangian relaxation; agent behavior; autonomous agent; computational constraint; convergence proof; decision support; domain-general property; heuristic solution; mixed-integer program; multi-agent planning; near-optimal solution; randomization; randomized rounding; statistical concentration; theme park crowd management problem; Gradient Descent; Lagrangian Relaxation; Multi-Agent Systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location
Macau
Print_ISBN
978-1-4673-6057-9
Type
conf
DOI
10.1109/WI-IAT.2012.252
Filename
6511613
Link To Document