Title :
How to partition a billion-node graph
Author :
Lu Wang ; Yanghua Xiao ; Bin Shao ; Haixun Wang
Author_Institution :
Hong Kong Univ. of Sci. & Technol., Hong Kong, China
fDate :
March 31 2014-April 4 2014
Abstract :
Billion-node graphs pose significant challenges at all levels from storage infrastructures to programming models. It is critical to develop a general purpose platform for graph processing. A distributed memory system is considered a feasible platform supporting online query processing as well as offline graph analytics. In this paper, we study the problem of partitioning a billion-node graph on such a platform, an important consideration because it has direct impact on load balancing and communication overhead. It is challenging not just because the graph is large, but because we can no longer assume that the data can be organized in arbitrary ways to maximize the performance of the partitioning algorithm. Instead, the algorithm must adopt the same data and programming model adopted by the system and other applications. In this paper, we propose a multi-level label propagation (MLP) method for graph partitioning. Experimental results show that our solution can partition billion-node graphs within several hours on a distributed memory system consisting of merely several machines, and the quality of the partitions produced by our approach is comparable to state-of-the-art approaches applied on toy-size graphs.
Keywords :
distributed memory systems; graph theory; query processing; resource allocation; MLP method; billion-node graph partitioning; communication overhead; distributed memory system; general purpose platform; graph processing; load balancing; multilevel label propagation method; offline graph analytics; online query processing; toy-size graphs; Algorithm design and analysis; Communities; Educational institutions; Frequency modulation; Partitioning algorithms; Semantics; Social network services;
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
DOI :
10.1109/ICDE.2014.6816682