Title :
Exploring the relationship between parallel application run-time and network performance in clusters
Author :
Evans, Jeffrey J. ; Groop, W.D. ; Hood, Cynthia S.
Author_Institution :
Dept. of Electr. & Comput. Eng. Technol., Purdue Univ., West Lafayette, IN, USA
Abstract :
Highly variable parallel application execution time is a persistent issue in cluster computing environments, and can be particularly acute in systems composed of networks of workstations (NOWs). We are looking at this issue in terms of consistency. In particular, we are focusing on network performance. Before we can use techniques from fault management to attain consistency, this paper presents our preliminary analysis of run-time variability from logs and experiments, exposing important issues related to systemic inconsistency in NOW clusters. The characterization of application sensitivity can be used to set network performance goals, thereby defining operational requirements. Network performance depends on the virtual topology imposed by the scheduler´s allocation of nodes and the communication patterns of the set of running applications. Therefore it is important to look at both the network and the cluster´s centralized node mapper (scheduler) as critical subsystems.
Keywords :
network topology; workstation clusters; cluster computing; communication patterns; fault management; run-time variability; virtual topology; Application software; Computer networks; Computer science; Degradation; Intelligent networks; Network topology; Processor scheduling; Resource management; Runtime; Workstations;
Conference_Titel :
Local Computer Networks, 2003. LCN '03. Proceedings. 28th Annual IEEE International Conference on
Print_ISBN :
0-7695-2037-5
DOI :
10.1109/LCN.2003.1243180