DocumentCode
3222491
Title
Fast PGAS Implementation of Distributed Graph Algorithms
Author
Cong, Guojing ; Almasi, George ; Saraswat, Vijay
Author_Institution
IBM TJ Watson Res. Center, Yorktown Heights, NY, USA
fYear
2010
fDate
13-19 Nov. 2010
Firstpage
1
Lastpage
11
Abstract
Due to the memory intensive workload and the erratic access pattern, irregular graph algorithms are notoriously hard to implement and optimize for high performance on distributed-memory systems. Although the PGAS paradigm proposed recently improves ease of programming, no high performance PGAS implementation of large-scale graph analysis is known. We present the first fast PGAS implementation of graph algorithms for the connected components and minimum spanning tree problems. By improving memory access locality, compared with the naive implementation, our implementation exhibits much better communication efficiency and cache performance on a cluster of SMPs. With additional algorithmic and PGASspecific optimizations, our implementation achieves significant speedups over both the best sequential implementation and the best single-node SMP implementation for large, sparse graphs with more than a billion edges.
Keywords
distributed memory systems; parallel algorithms; trees (mathematics); SMP cluster; distributed graph algorithms; distributed-memory systems; erratic access pattern; fast PGAS; irregular graph algorithms; large-scale graph analysis; minimum spanning tree problems; parallel graph algorithms; Algorithm design and analysis; Clustering algorithms; Electronics packaging; Instruction sets; Optimization; Phase change random access memory;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2010 International Conference for
Conference_Location
New Orleans, LA
Print_ISBN
978-1-4244-7557-5
Electronic_ISBN
978-1-4244-7558-2
Type
conf
DOI
10.1109/SC.2010.26
Filename
5644844
Link To Document