Title :
Multicore Surprises: Lessons Learned from Optimizing Sweep3D on the Cell Broadband Engine
Author :
Petrini, Fabrizio ; Fossum, Gordon ; Fernandez, Juan ; Varbanescu, Ana Lucia ; Kistler, Mike ; Perrone, Michael
Author_Institution :
Pacific Northwest Nat. Lab., Richland, WA
Abstract :
The Cell Broadband Engine (BE) processor provides the potential to achieve an impressive level of performance for scientific applications. This level of performance can be reached by exploiting several dimensions of parallelism, such as thread-level parallelism using several synergistic processing elements, data streaming parallelism, vector parallelism in the form of 128-bit SIMD operations, and pipeline parallelism by issuing multiple instructions in the same clock cycle. In our exploration to achieve the optimum level of performance for Sweep3D, we have enjoyed many pleasant surprises, such as a very high floating point performance, reaching 64% of the theoretical peak in double precision, and an over all performance speedup ranging from 4.5 times when compared with "heavy iron" processors, up to over 20 times with conventional processors.
Keywords :
computer architecture; natural sciences computing; parallel processing; pipeline processing; 128-bit SIMD operations; Cell Broadband Engine processor; Sweep3D optimization; multicore surprises; pipeline parallelism; scientific applications; Buildings; Computer architecture; Concurrent computing; Energy consumption; Engines; High performance computing; Laboratories; Multicore processing; Parallel processing; Yarn;
Conference_Titel :
Parallel and Distributed Processing Symposium, 2007. IPDPS 2007. IEEE International
Conference_Location :
Long Beach, CA
Print_ISBN :
1-4244-0910-1
Electronic_ISBN :
1-4244-0910-1
DOI :
10.1109/IPDPS.2007.370252