Title :
A technique for adaptation to available resources on clusters independent of synchronization methods used
Author :
Rencuzogullari, Umit ; Dwarkadas, Sandhya
Author_Institution :
Dept. of Comput. Sci., Rochester Univ., NY, USA
Abstract :
Clusters of workstations (COW) offer high performance relative to their cost. Generally these clusters operate as autonomous systems running independent copies of the operating system, where access to machines is not controlled and all users enjoy the same access privileges. While these features are desirable and reduce operating costs, they create adverse effects on parallel applications running on these clusters. Load imbalances are common for parallel applications on COWs due to: 1) variable amount of load on nodes caused by an inherent lack of parallelism, 2) variable resource availability on nodes, and 3) independent scheduling decisions made by the independent schedulers on each node. Our earlier study has shown that an approach combining static program analysis, dynamic load balancing, and scheduler cooperation is effective in countering the adverse effects mentioned above. In our current study, we investigate the scalability of our approach as the number of processors is increased. We further relax the requirement of global synchronization, avoiding the need to use barriers and allowing the use of any other synchronization primitives while still achieving dynamic load balancing. The use of alternative synchronization primitives avoids the inherent vulnerability of barriers to load imbalance. It also allows load balancing to take place at any point in the course of execution, rather than only at a synchronization point, potentially reducing the time the application runs imbalanced. Moreover, load readjustment decisions are made in a distributed fashion, thus preventing any need for processes to globally synchronize in order to redistribute load.
Keywords :
distributed shared memory systems; parallel programming; processor scheduling; program diagnostics; resource allocation; synchronisation; workstation clusters; access privileges; autonomous systems; available resource adaptation; dynamic load balancing; independent operating system copies; independent scheduling decisions; load balancing; load imbalances; load readjustment decisions; nodes; parallel applications; scalability; scheduler cooperation; static program analysis; synchronization methods; synchronization primitives; variable load; variable resource availability; workstation clusters; Availability; Control systems; Costs; Cows; Dynamic scheduling; Load management; Operating systems; Processor scheduling; Scalability; Workstations;
Conference_Titel :
Parallel Processing, 2002. Proceedings. International Conference on
Print_ISBN :
0-7695-1677-7
DOI :
10.1109/ICPP.2002.1040895