Title :
Breaking the speed and scalability Barriers for Graph exploration on distributed-memory machines
Author :
Checconi, Fabio ; Petrini, Fabrizio ; Willcock, J. ; Lumsdaine, A. ; Choudhury, Amitavo Roy ; Sabharwal, Yogish
Author_Institution :
IBM TJ Watson, Yorktown Heights, NY, USA
Abstract :
In this paper, we describe the challenges involved in designing a family of highly-efficient Breadth-First Search (BFS) algorithms and in optimizing these algorithms on the latest two generations of Blue Gene machines, Blue Gene/P and Blue Gene/Q. With our recent winning Graph 500 submissions in November 2010, June 2011, and November 2011, we have achieved unprecedented scalability results in both space and size. On Blue Gene/P, we have been able to parallelize a scale 38 problem with 238 vertices and 242 edges on 131,072 processing cores. Using only four racks of an experimental configuration of Blue Gene/Q, we have achieved a processing rate of 254 billion edges per second on 65,536 processing cores. This paper describes the algorithmic design and the main classes of optimizations that we have used to achieve these results.
Keywords :
distributed memory systems; graph theory; search problems; BFS; blue gene machines; blue gene/p; blue gene/q; breadth first search algorithms; distributed memory machines; graph exploration; Algorithm design and analysis; Benchmark testing; Communities; Computer architecture; Prediction algorithms; Program processors; Software algorithms;
Conference_Titel :
High Performance Computing, Networking, Storage and Analysis (SC), 2012 International Conference for
Conference_Location :
Salt Lake City, UT
Print_ISBN :
978-1-4673-0805-2