Title :
Improving the performance of communication-intensive parallel applications executing on clusters
Author :
Qin, Xiao ; Jiang, Hong
Author_Institution :
Dept. of Comput. Sci. & Eng., Nebraska Univ. Lincoln, NE, USA
Abstract :
Summary form only given. Clusters have emerged as a primary and cost-effective infrastructure for parallel applications, including communication-intensive applications that transfer a large amount of data among nodes of a cluster via the interconnection network. Conventional load balancers have been proven effective in increasing the utilization of CPU, memory, and disk I/O resources in a cluster. However, most of the existing load balancing schemes ignore network resources, leaving open the opportunity for significant performance bottleneck to form for communication-intensive parallel applications due to unevenly distributed communication load. To remedy this problem, we propose a communication-aware load balancing technique that is capable of improving the performance of communication-intensive applications by increasing the effective utilization of network resources in clusters. To facilitate the proposed load-balancing scheme, we introduce a behavior model for parallel applications with large requirements of CPU, memory, network, and disk 170 resources. The proposed load-balancing scheme can make full use of this model to quickly and accurately determine the load induced by a variety of parallel applications. Simulation results on executing a diverse set of both synthetic bulk synchronous and real parallel applications on a cluster show that the proposed scheme can significantly improve the performance both in slowdown and turn-around time over three existing schemes by up to 206% (with an average of 74%) and 235% (with an average of 82%), respectively.
Keywords :
message passing; parallel processing; resource allocation; workstation clusters; CPU utilization; cluster infrastructure; communication-aware load balancing technique; communication-intensive parallel applications; data transfer; disk I/O resources; distributed communication load; interconnection network; memory utilization; network nodes; network resources; performance bottleneck; synchronous applications; Application software; Computer science; Data engineering; Load management; Multiprocessor interconnection networks;
Conference_Titel :
Cluster Computing, 2004 IEEE International Conference on
Print_ISBN :
0-7803-8694-9
DOI :
10.1109/CLUSTR.2004.1392658