DocumentCode :
2414020
Title :
Incorporating job migration and network RAM to share cluster memory resources
Author :
Xiao, Li ; Zhang, Xiaodong ; Kubricht, Stefan A.
Author_Institution :
Dept. of Comput. Sci., Coll. of William & Mary, Williamsburg, VA, USA
fYear :
2000
fDate :
2000
Firstpage :
71
Lastpage :
78
Abstract :
Job migrations and network RAM are two approaches for effectively using global memory resources in a workstation cluster, aimed at reducing page faults in each local workstation and improving the overall performance of cluster computing. Using either remote executions or pre-emptive migrations, a load-sharing system is able to migrate a job from a workstation without sufficient memory space to a lightly loaded workstation with a large idle memory space for the migrated job. In a network RAM system, if a job cannot find sufficient memory space for its working sets, it utilizes idle memory space from other workstations in the cluster through remote paging. Conducting trace-driven simulations, we have compared the performance and tradeoffs of the two approaches and their impacts on job execution time and cluster scalability. Job migration-based load-sharing schemes are able to balance executions of jobs in a cluster well, while network RAM is able to satisfy data-intensive jobs which may not be migratable by sharing all the idle memory resources in a cluster. A network RAM cluster of workstations is scalable only if the network is sufficiently fast. We propose an improved load-sharing scheme by combining job migrations with network RAM for cluster computing. This scheme uses remote execution to initially allocate a job to the most lightly loaded workstation and, if necessary, network RAM to provide a larger memory space for the job than would be available otherwise. The improved scheme has the merits of both job migrations and network RAM. Our experiments show its effectiveness and scalability for cluster computing
Keywords :
distributed programming; paged storage; performance evaluation; random-access storage; resource allocation; shared memory systems; virtual machines; workstation clusters; cluster computing; cluster memory resource sharing; cluster scalability; data-intensive jobs; global memory resources; idle memory space; job allocation; job execution load balancing; job execution time; job migration; lightly loaded workstation; load-sharing system; network RAM; performance; preemptive migrations; remote executions; remote paging; trace-driven simulations; tradeoffs; working sets; workstation cluster; workstation page faults; Computer networks; Computer science; Delay; Educational institutions; Military computing; Random access memory; Read-write memory; Sun; Telecommunication traffic; Workstations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High-Performance Distributed Computing, 2000. Proceedings. The Ninth International Symposium on
Conference_Location :
Pittsburgh, PA
ISSN :
1082-8907
Print_ISBN :
0-7695-0783-2
Type :
conf
DOI :
10.1109/HPDC.2000.868636
Filename :
868636
Link To Document :
بازگشت