DocumentCode :
2485058
Title :
Early experiences with large-scale Cray XMT systems
Author :
Mizell, David ; Maschhoff, Kristyn
Author_Institution :
Cray Inc., Seattle, WA, USA
fYear :
2009
fDate :
23-29 May 2009
Firstpage :
1
Lastpage :
9
Abstract :
Several 64-processor XMT systems have now been shipped to customers and there have been 128-processor, 256-processor and 512-processor systems tested in Cray´s development lab. We describe some techniques we have used for tuning performance in hopes that applications continued to scale on these larger systems. We discuss how the programmer must work with the XMT compiler to extract maximum parallelism and performance, especially from multiply nested loops, and how the performance tools provide vital information about whether or how the compiler has parallelized loops and where performance bottlenecks may be occurring. We also show data that indicate that the maximum performance of a given application on a given size XMT system is limited by memory or network bandwidth, in a way that is somewhat independent of the number of processors used.
Keywords :
program compilers; program testing; software performance evaluation; 128-processor systems; 256-processor systems; 512-processor systems; 64-processor XMT systems; XMT compiler; large-scale Cray XMT systems; maximum parallelism; memory bandwidth; network bandwidth; tuning performance; Bandwidth; Computer architecture; Large-scale systems; Program processors; Programming profession; Prototypes; Switches; System testing; Throughput; Yarn; multithreading; performance tuning; scaling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
Conference_Location :
Rome
ISSN :
1530-2075
Print_ISBN :
978-1-4244-3751-1
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2009.5161108
Filename :
5161108
Link To Document :
بازگشت