Title :
Cluster performance and the implications for distributed, heterogeneous grid performance
Author :
Lee, Craig ; DeMatteis, Cheryl ; Stepanek, James ; Wang, Johnson
Author_Institution :
Dept. of Comput. Syst. Res., Aerosp. Corp., El Segundo, CA, USA
Abstract :
Examines the issues surrounding efficient execution in heterogeneous grid environments. The performances of a Linux cluster and a parallel supercomputer are initially compared using both benchmarks and an application. With an understanding of how benchmark and application performance is affected by processor and interconnect speed, a comparison is made with the bandwidth and latencies available in a tested grid. Of significant concern is the fact that the available communication bandwidth and latencies have a dynamic range of 3 to 4 orders of magnitude, while processor speeds have a range of about one-half order of magnitude. Also, while both processor speed and network bandwidth are increasing very rapidly, simple propagation delay will become more significant in the network latencies seen by many grid applications. That is to say, the pipes in a grid will be getting fatter but not commensurately shorter. How are we to effectively utilize such an infrastructure? Clearly, an attractive approach is to require sufficient concurrency in the application such that a coarse-grain, data-driven model of execution can be used to hide latencies while hopefully keeping context-switching overheads low. If the “spatial component” of an application is understood, then runtime systems could also apply established techniques like caching, compression, estimation and speculative pre-fetching. Ideally, this low-level performance management should be encapsulated in an easy-to-use abstraction
Keywords :
multiprocessing systems; performance evaluation; wide area networks; workstation clusters; Linux cluster performance; application concurrency; application performance; benchmarks; caching; coarse-grained data-driven execution model; communication bandwidth; compression; context-switching overheads; distributed heterogeneous grid performance; dynamic range; easy-to-use abstraction; estimation; interconnect speed; low-level performance management; network bandwidth; network latencies; parallel supercomputer; processor speed; propagation delay; runtime systems; spatial component; speculative prefetching; Bandwidth; Benchmark testing; Clocks; Concurrent computing; Context modeling; Costs; Ear; Linux; Propagation delay; Supercomputers;
Conference_Titel :
Heterogeneous Computing Workshop, 2000. (HCW 2000) Proceedings. 9th
Conference_Location :
Cancun
Print_ISBN :
0-7695-0556-2
DOI :
10.1109/HCW.2000.843749