DocumentCode :
2092425
Title :
Performance Analysis of CFD Application Cart3D Using MPInside and Performance Monitor Unit Data on Nehalem and Westmere Based Supercomputers
Author :
Saini, Subhash ; Mehrotra, Piyush ; Taylor, Kenichi ; Aftosmis, Michael ; Biswas, Rupak
Author_Institution :
NASA Adv. Supercomput. Div., NASA Ames Res. Center, Moffett Field, CA, USA
fYear :
2011
fDate :
2-4 Sept. 2011
Firstpage :
331
Lastpage :
338
Abstract :
Cart3D is a computational fluid dynamics (CFD) application aimed at conceptual and preliminary design of aerospace vehicles with complex geometries. It is widely used by design engineers at NASA, Department of Defense and aerospace companies in the USA. We present detailed performance analysis of Cart3D using two tools SGI MPInside and op_scope that collects hardware counter data from Intel Performance Monitoring Unit (PMU) on supercomputers based on Nehalem micro-architecture. Using these tools, we have done dynamic profiling of Cart3D (compute time, communication time and I/O time), along with dynamic profiling of MPI functions (MPI_Sendrecv, MPI_Bcast, MPI_Isend, MPI_Irecv, MPI_Allreduce, MPI_Barrier, etc.) with respect to message size of each rank and time consumed by each function. MPI communication is further analyzed by studying the performance of MPI functions used in this application as a function of message size and number of cores. Using these tools we have also studied efficiency of the processor to measure its effective utilization, efficiency of the floating-point units, percentage of vectorization and percentage of data coming from L2 cache, L3 cache, and main memory. This study was performed on two computing sub-systems based on quad-core Nehalem-EP and hex-core West mere-EP processors that are part of Pleiades an SGI Altix ICE at NASA Ames Research Center.
Keywords :
aerospace; aircraft; computational fluid dynamics; computer architecture; defence industry; mainframes; message passing; multiprocessing systems; parallel machines; CFD application; Cart3D; Department of Defense; Intel Performance Monitoring Unit; L2 cache; L3 cache; MPI communication; MPI functions; MPI_Allreduce; MPI_Barrier; MPI_Bcast; MPI_Irecv; MPI_Isend; MPI_Sendrecv; NASA Ames Research Center; Nehalem based supercomputers; Nehalem micro-architecture; PMU; Pleiades; SGI Altix ICE; SGI MPInside; USA; Westmere based supercomputers; aerospace companies; aerospace vehicles; complex geometries; computational fluid dynamics application; compute time communication time IO time; design engineers; dynamic profiling; floating-point units; hardware counter data; hex-core West mere-EP processors; main memory; op_scope; performance analysis; performance monitor unit data; quadcore Nehalem-EP processors; Benchmark testing; Computational fluid dynamics; Hardware; Ice; Monitoring; NASA; Phasor measurement units; Benchmarking; CFD application; Hyper-Threading; Intel Nehalem micro-architecture; Performance Evaluation; Processor efficiency; SGI Altix ICE; Simultaneous Multi Threading (SMT);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Computing and Communications (HPCC), 2011 IEEE 13th International Conference on
Conference_Location :
Banff, AB
Print_ISBN :
978-1-4577-1564-8
Electronic_ISBN :
978-0-7695-4538-7
Type :
conf
DOI :
10.1109/HPCC.2011.50
Filename :
6063009
Link To Document :
بازگشت