DocumentCode :
1998784
Title :
Distributed Memory Breadth-First Search Revisited: Enabling Bottom-Up Search
Author :
Beamer, Scott ; Buluc, Aydin ; Asanovic, Krste ; Patterson, Dean
Author_Institution :
EECS Dept., Univ. of California, Berkeley, Berkeley, CA, USA
fYear :
2013
fDate :
20-24 May 2013
Firstpage :
1618
Lastpage :
1627
Abstract :
Breadth-first search (BFS) is a fundamental graph primitive frequently used as a building block for many complex graph algorithms. In the worst case, the complexity of BFS is linear in the number of edges and vertices, and the conventional top-down approach always takes as much time as the worst case. A recently discovered bottom-up approach manages to cut down the complexity all the way to the number of vertices in the best case, which is typically at least an order of magnitude less than the number of edges. The bottom-up approach is not always advantageous, so it is combined with the top-down approach to make the direction-optimizing algorithm which adaptively switches from top-down to bottom-up as the frontier expands. We present a scalable distributed-memory parallelization of this challenging algorithm and show up to an order of magnitude speedups compared to an earlier purely top-down code. Our approach also uses a 2D decomposition of the graph that has previously been shown to be superior to a 1D decomposition. Using the default parameters of the Graph500 benchmark, our new algorithm achieves a performance rate of over 240 billion edges per second on 115 thousand cores of a Cray XE6, which makes it over 7× faster than a conventional top-down algorithm using the same set of optimizations and data distribution.
Keywords :
distributed memory systems; parallel algorithms; tree searching; 1D decomposition; 2D graph decomposition; BFS; Cray XE6; Graph500 benchmark; bottom-up search; complex graph algorithms; data distribution; distributed memory breadth-first search; graph primitive; scalable distributed-memory parallelization; Algorithm design and analysis; Complexity theory; Parallel processing; Partitioning algorithms; Sparse matrices; Syntactics; Vectors; breadth-first search; direction-optimizing; distributed;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), 2013 IEEE 27th International
Conference_Location :
Cambridge, MA
Print_ISBN :
978-0-7695-4979-8
Type :
conf
DOI :
10.1109/IPDPSW.2013.159
Filename :
6651058
Link To Document :
بازگشت