• DocumentCode
    1120101
  • Title

    Resource-Aware Distributed Scheduling Strategies for Large-Scale Computational Cluster/Grid Systems

  • Author

    Viswanathan, Sivakumar ; Veeravalli, Bharadwaj ; Robertazzi, Thomas G.

  • Author_Institution
    Nat. Univ. of Singapore, Singapore
  • Volume
    18
  • Issue
    10
  • fYear
    2007
  • Firstpage
    1450
  • Lastpage
    1461
  • Abstract
    In this paper, we propose distributed algorithms referred to as resource-aware dynamic incremental scheduling (RADIS) strategies. Our strategies are specifically designed to handle large volumes of computationally intensive arbitrarily divisible loads submitted for processing at cluster/grid systems involving multiple sources and sinks (processing nodes). We consider a real-life scenario, wherein the buffer space (memory) available at the sinks (required for holding and processing the loads) varies over time, and the loads have deadlines and propose efficient "pull-based" scheduling strategies with an admission control policy that ensures that the admitted loads are processed, satisfying their deadline requirements. The design of our proposed strategies adopts the divisible load paradigm, referred to as the divisible load theory (DLT), which is shown to be efficient in handling large volume loads. We demonstrate detailed workings of the proposed algorithms via a simulation study by using real-life parameters obtained from a major physics experiment.
  • Keywords
    buffer storage; distributed algorithms; grid computing; resource allocation; scheduling; admission control; buffer space; computational cluster system; distributed algorithms; divisible load theory; grid systems; pull-based scheduling strategies; resource-aware dynamic incremental scheduling; Algorithm design and analysis; Clustering algorithms; Computer networks; Distributed computing; Dynamic scheduling; Grid computing; Large-scale systems; Physics computing; Processor scheduling; Resource management; Cluster computing; Divisible loads; Grid computing; buffer constraints; deadlines; processing time;
  • fLanguage
    English
  • Journal_Title
    Parallel and Distributed Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/TPDS.2007.1073
  • Filename
    4302731