DocumentCode :
580128
Title :
Early performance evaluation of a "Nehalem" cluster using scientific and engineering applications
Author :
Saini, Shrikant ; Naraikin, A. ; Biswas, Rubel ; Barkai, D. ; Sandstrom, T.
Author_Institution :
NASA Ames Res., Moffett Field, CA, USA
fYear :
2009
fDate :
14-20 Nov. 2009
Firstpage :
1
Lastpage :
12
Abstract :
In this paper, we present an early performance evaluation of a 624-core cluster based on the Intel® Xeon® Processor 5560 (code named "Nehalem-EP", and referred to as Xeon 5560 in this paper)---the third-generation quad-core architecture from Intel. This is the first processor from Intel with a non-uniform memory access (NUMA) architecture managed by on-chip integrated memory controller. It employs a point-to-point interconnect called the Intel® QuickPath Interconnect (QPI) between processors and to the input/output (I/O) hub. It also introduces to a quad-core architecture both Intel\´s hyper-threading technology (or simultaneous multi-threading, "SMT") and Intel® Turbo Boost Technology ("Turbo mode") that automatically allow processor cores to run faster than the base operating frequency if the processor is operating below rated power, temperature, and current specification limits. It can be engaged with any number of cores or logical processors enabled and active. We critically evaluate these features using the High Performance Computing Challenge (HPCC) benchmarks, NAS Parallel Benchmarks (NPB), and four full-scale scientific applications. We compare and contrast the results of a cluster based on the Xeon 5560 with an SGI® Altix® ICE 8200EX cluster of quad-core Intel® Xeon® 5472 Processor ("Xeon 5472" from here on) and another cluster of Intel® Xeon® 5462 Processor ("Xeon 5462"; the Xeon 5400 Series Processors are previous generation quad-core Intel processors and were code named Harpertown).
Keywords :
benchmark testing; input-output programs; multi-threading; multiprocessing systems; multiprocessor interconnection networks; parallel processing; performance evaluation; HPCC benchmarks; I/O hub; Intel QuickPath Interconnect; Intel Turbo Boost Technology; Intel Xeon Processor 5560; NAS parallel benchmarks; NPB; NUMA architecture; Nehalem cluster; Nehalem-EP; QPI; SGI Altix ICE 8200EX cluster; SMT; Xeon 5560; base operating frequency; current specification limits; engineering application; four full-scale scientific applications; high performance computing challenge benchmarks; hyper-threading technology; input/output hub; logical processors; nonuniform memory access architecture; on-chip integrated memory controller; performance evaluation; point-to-point interconnect; processor cores; quad-core Intel Xeon 5472 processor; simultaneous multithreading; third-generation quad-core architecture; turbo mode;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing Networking, Storage and Analysis, Proceedings of the Conference on
Conference_Location :
Portland, OR
Type :
conf
DOI :
10.1145/1654059.1654084
Filename :
6375577
Link To Document :
بازگشت