Title :
Coordination of data movement with computation scheduling on a cluster
Author :
Bent, John ; Rotem, Dorm ; Romosan, Alexandru ; Shoshani, Ark
Author_Institution :
Lawrence Berkeley Nat. Lab., California Univ., Berkeley, CA, USA
Abstract :
We are looking at the problem of scheduling compute tasks on a cluster of servers. These tasks require files that reside on a remote archive, and may also be cached on some subset of the servers. A task can only be run on a server that has the files it requires. This introduces the problem of scheduling data movement in coordination with the scheduling of computation. Our goal is to maximize throughput while minimizing data movement. FIFO scheduling is not efficient in this situation due to its lack of awareness of the data movement required. We looked at two other strategies, called shortest job first and linear programming based optimization, and compared them under various configurations.
Keywords :
cache storage; file servers; linear programming; processor scheduling; workstation clusters; computation scheduling; data movement scheduling; linear programming; optimization; server cluster; shortest job first scheduling; Computational modeling; Data analysis; File servers; Job design; Laboratories; Linear programming; Physics; Processor scheduling; Throughput; Whales;
Conference_Titel :
Challenges of Large Applications in Distributed Environments, 2005. CLADE 2005. Proceedings
Print_ISBN :
0-7803-9043-1
DOI :
10.1109/CLADE.2005.1520896