DocumentCode
3428255
Title
Breaking the speed and scalability Barriers for Graph exploration on distributed-memory machines
Author
Checconi, Fabio ; Petrini, Fabrizio ; Willcock, J. ; Lumsdaine, A. ; Choudhury, Amitavo Roy ; Sabharwal, Yogish
Author_Institution
IBM TJ Watson, Yorktown Heights, NY, USA
fYear
2012
fDate
10-16 Nov. 2012
Firstpage
1
Lastpage
12
Abstract
In this paper, we describe the challenges involved in designing a family of highly-efficient Breadth-First Search (BFS) algorithms and in optimizing these algorithms on the latest two generations of Blue Gene machines, Blue Gene/P and Blue Gene/Q. With our recent winning Graph 500 submissions in November 2010, June 2011, and November 2011, we have achieved unprecedented scalability results in both space and size. On Blue Gene/P, we have been able to parallelize a scale 38 problem with 238 vertices and 242 edges on 131,072 processing cores. Using only four racks of an experimental configuration of Blue Gene/Q, we have achieved a processing rate of 254 billion edges per second on 65,536 processing cores. This paper describes the algorithmic design and the main classes of optimizations that we have used to achieve these results.
Keywords
distributed memory systems; graph theory; search problems; BFS; blue gene machines; blue gene/p; blue gene/q; breadth first search algorithms; distributed memory machines; graph exploration; Algorithm design and analysis; Benchmark testing; Communities; Computer architecture; Prediction algorithms; Program processors; Software algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
Conference_Location
Salt Lake City, UT
ISSN
2167-4329
Print_ISBN
978-1-4673-0805-2
Type
conf
DOI
10.1109/SC.2012.25
Filename
6468459
Link To Document