DocumentCode
2222537
Title
Coordination of data movement with computation scheduling on a cluster
Author
Bent, John ; Rotem, Dorm ; Romosan, Alexandru ; Shoshani, Ark
Author_Institution
Lawrence Berkeley Nat. Lab., California Univ., Berkeley, CA, USA
fYear
2005
fDate
38557
Firstpage
25
Lastpage
34
Abstract
We are looking at the problem of scheduling compute tasks on a cluster of servers. These tasks require files that reside on a remote archive, and may also be cached on some subset of the servers. A task can only be run on a server that has the files it requires. This introduces the problem of scheduling data movement in coordination with the scheduling of computation. Our goal is to maximize throughput while minimizing data movement. FIFO scheduling is not efficient in this situation due to its lack of awareness of the data movement required. We looked at two other strategies, called shortest job first and linear programming based optimization, and compared them under various configurations.
Keywords
cache storage; file servers; linear programming; processor scheduling; workstation clusters; computation scheduling; data movement scheduling; linear programming; optimization; server cluster; shortest job first scheduling; Computational modeling; Data analysis; File servers; Job design; Laboratories; Linear programming; Physics; Processor scheduling; Throughput; Whales;
fLanguage
English
Publisher
ieee
Conference_Titel
Challenges of Large Applications in Distributed Environments, 2005. CLADE 2005. Proceedings
Print_ISBN
0-7803-9043-1
Type
conf
DOI
10.1109/CLADE.2005.1520896
Filename
1520896
Link To Document