• DocumentCode
    2222537
  • Title

    Coordination of data movement with computation scheduling on a cluster

  • Author

    Bent, John ; Rotem, Dorm ; Romosan, Alexandru ; Shoshani, Ark

  • Author_Institution
    Lawrence Berkeley Nat. Lab., California Univ., Berkeley, CA, USA
  • fYear
    2005
  • fDate
    38557
  • Firstpage
    25
  • Lastpage
    34
  • Abstract
    We are looking at the problem of scheduling compute tasks on a cluster of servers. These tasks require files that reside on a remote archive, and may also be cached on some subset of the servers. A task can only be run on a server that has the files it requires. This introduces the problem of scheduling data movement in coordination with the scheduling of computation. Our goal is to maximize throughput while minimizing data movement. FIFO scheduling is not efficient in this situation due to its lack of awareness of the data movement required. We looked at two other strategies, called shortest job first and linear programming based optimization, and compared them under various configurations.
  • Keywords
    cache storage; file servers; linear programming; processor scheduling; workstation clusters; computation scheduling; data movement scheduling; linear programming; optimization; server cluster; shortest job first scheduling; Computational modeling; Data analysis; File servers; Job design; Laboratories; Linear programming; Physics; Processor scheduling; Throughput; Whales;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Challenges of Large Applications in Distributed Environments, 2005. CLADE 2005. Proceedings
  • Print_ISBN
    0-7803-9043-1
  • Type

    conf

  • DOI
    10.1109/CLADE.2005.1520896
  • Filename
    1520896