DocumentCode :
3025084
Title :
A scalable method for predicting network performance in heterogeneous clusters
Author :
Katramatos, Dimitrios ; Chapin, Steve J.
Author_Institution :
Dept. of Comput. Sci., Virginia Univ., Charlottesville, VA, USA
fYear :
2005
fDate :
7-9 Dec. 2005
Abstract :
An important requirement for the effective scheduling of parallel applications on large heterogeneous clusters is a current view of system resource availability. Maintaining such a view is a time consuming problem, potentially O(N2). Although CPU availability is relatively easy to monitor, interconnecting network bandwidth varies not only with network topology, but also with message size and even with respect to the load of the communicating nodes. This paper describes a method for predicting a cluster´s network performance for the purpose of scheduling parallel applications. The method generates a cluster-specific network model which can predict the latency of communications between any pair of nodes in linear time and under any computational and/or communication load conditions. The paper also presents the models generated for the Centurion cluster at the University of Virginia and the Orange Grove cluster at Syracuse University. A study of the prediction accuracy of the method under various load conditions by comparison to experimental measurements indicates an average prediction error of approximately 5% with the maximum encountered prediction error of less than 9%.
Keywords :
computational complexity; parallel processing; performance evaluation; scheduling; workstation clusters; cluster network performance; cluster-specific network model; communication load condition; heterogeneous clusters; parallel application; scheduling; system resource availability; time consuming problem; Availability; Bandwidth; Computer networks; Computer science; Condition monitoring; Delay; Intelligent networks; Network topology; Processor scheduling; Programming environments;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel Architectures,Algorithms and Networks, 2005. ISPAN 2005. Proceedings. 8th International Symposium on
ISSN :
1087-4089
Print_ISBN :
0-7695-2509-1
Type :
conf
DOI :
10.1109/ISPAN.2005.11
Filename :
1575840
Link To Document :
بازگشت