• DocumentCode
    249486
  • Title

    LightGraph: Lighten Communication in Distributed Graph-Parallel Processing

  • Author

    Yue Zhao ; Yoshigoe, Kenji ; Mengjun Xie ; Suijian Zhou ; Seker, Remzi ; Jiang Bian

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Arkansas at Little Rock, Little Rock, AR, USA
  • fYear
    2014
  • fDate
    June 27 2014-July 2 2014
  • Firstpage
    717
  • Lastpage
    724
  • Abstract
    A number of graph-structured computing abstractions have been proposed to address the needs of solving complex and large-scale graph algorithms. Distributed Graphlab and its successor, PowerGraph, are two such frameworks that have demonstrated excellent performance with high scalability and fault tolerance. However, excessive communication and state sharing among nodes in these frameworks not only reduce network efficiency but may also cause a decrease in runtime performance. In this paper, we first propose a mechanism that identifies and eliminates the avoidable communication during synchronization in existing distributed graph structured computing abstractions. We have implemented our method on PowerGraph and created LightGraph to reduce communication overhead in distributed graph-parallel computation systems. Furthermore, to minimize the required intra-graph synchronizations for PageRank-like applications, LightGraph also employs an edge direction-aware graph partitioning strategy, which optimally isolates the outgoing edges from the incoming edges of a vertex when creating and distributing replicas among different machines. We have conducted extensive experiments using real-world data, and our results verified the effectiveness of LightGraph. For example, when compared with the best existing graph placement method in PowerGraph, LightGraph can not only reduce up to 27.6% of synchronizing communication overhead for intra-graph synchronizations but also cut up to 17.1% runtime for PageRank.
  • Keywords
    graph theory; parallel processing; LightGraph; PageRank-like applications; PowerGraph; complex graph algorithms; distributed graph structured computing abstractions; distributed graph-parallel processing; edge direction-aware graph partitioning strategy; graph placement method; intragraph synchronizations; large-scale graph algorithms; lighten communication; Computational modeling; Distributed databases; Educational institutions; Mirrors; Partitioning algorithms; Runtime; Synchronization; Big-data; Communication Overhead; Graph-parallel Computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Big Data (BigData Congress), 2014 IEEE International Congress on
  • Conference_Location
    Anchorage, AK
  • Print_ISBN
    978-1-4799-5056-0
  • Type

    conf

  • DOI
    10.1109/BigData.Congress.2014.106
  • Filename
    6906849