DocumentCode
249486
Title
LightGraph: Lighten Communication in Distributed Graph-Parallel Processing
Author
Yue Zhao ; Yoshigoe, Kenji ; Mengjun Xie ; Suijian Zhou ; Seker, Remzi ; Jiang Bian
Author_Institution
Dept. of Comput. Sci., Univ. of Arkansas at Little Rock, Little Rock, AR, USA
fYear
2014
fDate
June 27 2014-July 2 2014
Firstpage
717
Lastpage
724
Abstract
A number of graph-structured computing abstractions have been proposed to address the needs of solving complex and large-scale graph algorithms. Distributed Graphlab and its successor, PowerGraph, are two such frameworks that have demonstrated excellent performance with high scalability and fault tolerance. However, excessive communication and state sharing among nodes in these frameworks not only reduce network efficiency but may also cause a decrease in runtime performance. In this paper, we first propose a mechanism that identifies and eliminates the avoidable communication during synchronization in existing distributed graph structured computing abstractions. We have implemented our method on PowerGraph and created LightGraph to reduce communication overhead in distributed graph-parallel computation systems. Furthermore, to minimize the required intra-graph synchronizations for PageRank-like applications, LightGraph also employs an edge direction-aware graph partitioning strategy, which optimally isolates the outgoing edges from the incoming edges of a vertex when creating and distributing replicas among different machines. We have conducted extensive experiments using real-world data, and our results verified the effectiveness of LightGraph. For example, when compared with the best existing graph placement method in PowerGraph, LightGraph can not only reduce up to 27.6% of synchronizing communication overhead for intra-graph synchronizations but also cut up to 17.1% runtime for PageRank.
Keywords
graph theory; parallel processing; LightGraph; PageRank-like applications; PowerGraph; complex graph algorithms; distributed graph structured computing abstractions; distributed graph-parallel processing; edge direction-aware graph partitioning strategy; graph placement method; intragraph synchronizations; large-scale graph algorithms; lighten communication; Computational modeling; Distributed databases; Educational institutions; Mirrors; Partitioning algorithms; Runtime; Synchronization; Big-data; Communication Overhead; Graph-parallel Computing;
fLanguage
English
Publisher
ieee
Conference_Titel
Big Data (BigData Congress), 2014 IEEE International Congress on
Conference_Location
Anchorage, AK
Print_ISBN
978-1-4799-5056-0
Type
conf
DOI
10.1109/BigData.Congress.2014.106
Filename
6906849
Link To Document