DocumentCode
1660111
Title
Dynamic load balancing of an iterative eigensolver on networks of heterogeneous clusters
Author
McCombs, James R. ; Mills, Richard Tran ; Stathopoulos, Andreas
Author_Institution
Dept. of Comput. Sci., Coll. of William & Mary, Williamsburg, VA, USA
fYear
2003
Abstract
Clusters of homogeneous workstations built around fast networks have become popular means of solving scientific problems, and users often have access to several such clusters. Harnessing the collective power of these clusters to solve a single, challenging problem is desirable, but is often impeded by large inter-cluster network latencies and heterogeneity of different clusters. The complexity of these environments requires commensurate advances in parallel algorithm design. We support this thesis by utilizing two techniques: 1) multigrain, a novel algorithmic technique that induces coarse granularity to parallel iterative methods, providing tolerance for large communication latencies, and 2) an application-level load balancing technique applicable to a specific but important class of iterative methods. We implement both algorithmic techniques on the popular Jacobi-Davidson eigenvalue iterative solver. Our experiments on a cluster environment show that the combination of the two techniques enables effective use of heterogeneous, possibly distributed resources, that cannot be achieved by traditional implementations of the method.
Keywords
eigenvalues and eigenfunctions; iterative methods; parallel algorithms; resource allocation; workstation clusters; Jacobi-Davidson eigenvalue iterative solver; application-level load balancing; coarse granularity; communication latency tolerance; distributed resources; dynamic load balancing; heterogeneous resources; iterative eigensolver; multigrain technique; networks of heterogeneous clusters; parallel algorithm design; parallel iterative methods; scientific problems; Algorithm design and analysis; Clustering algorithms; Delay; Impedance; Iterative algorithms; Iterative methods; Jacobian matrices; Load management; Parallel algorithms; Workstations;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2003. Proceedings. International
ISSN
1530-2075
Print_ISBN
0-7695-1926-1
Type
conf
DOI
10.1109/IPDPS.2003.1213126
Filename
1213126
Link To Document