DocumentCode
1120101
Title
Resource-Aware Distributed Scheduling Strategies for Large-Scale Computational Cluster/Grid Systems
Author
Viswanathan, Sivakumar ; Veeravalli, Bharadwaj ; Robertazzi, Thomas G.
Author_Institution
Nat. Univ. of Singapore, Singapore
Volume
18
Issue
10
fYear
2007
Firstpage
1450
Lastpage
1461
Abstract
In this paper, we propose distributed algorithms referred to as resource-aware dynamic incremental scheduling (RADIS) strategies. Our strategies are specifically designed to handle large volumes of computationally intensive arbitrarily divisible loads submitted for processing at cluster/grid systems involving multiple sources and sinks (processing nodes). We consider a real-life scenario, wherein the buffer space (memory) available at the sinks (required for holding and processing the loads) varies over time, and the loads have deadlines and propose efficient "pull-based" scheduling strategies with an admission control policy that ensures that the admitted loads are processed, satisfying their deadline requirements. The design of our proposed strategies adopts the divisible load paradigm, referred to as the divisible load theory (DLT), which is shown to be efficient in handling large volume loads. We demonstrate detailed workings of the proposed algorithms via a simulation study by using real-life parameters obtained from a major physics experiment.
Keywords
buffer storage; distributed algorithms; grid computing; resource allocation; scheduling; admission control; buffer space; computational cluster system; distributed algorithms; divisible load theory; grid systems; pull-based scheduling strategies; resource-aware dynamic incremental scheduling; Algorithm design and analysis; Clustering algorithms; Computer networks; Distributed computing; Dynamic scheduling; Grid computing; Large-scale systems; Physics computing; Processor scheduling; Resource management; Cluster computing; Divisible loads; Grid computing; buffer constraints; deadlines; processing time;
fLanguage
English
Journal_Title
Parallel and Distributed Systems, IEEE Transactions on
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/TPDS.2007.1073
Filename
4302731
Link To Document