DocumentCode :
2236000
Title :
Resource Availability Prediction in Fine-Grained Cycle Sharing Systems
Author :
Ren, Xiaojuan ; Lee, Seyong ; Eigenmann, Rudolf ; Bagchi, Saurabh
Author_Institution :
Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN
fYear :
0
fDate :
0-0 0
Firstpage :
93
Lastpage :
104
Abstract :
Fine-grained cycle sharing (FGCS) systems aim at utilizing the large amount of computational resources available on the Internet. In FGCS, host computers allow guest jobs to utilize the CPU cycles if the jobs do not significantly impact the local users of a host. A characteristic of such resources is that they are generally provided voluntarily and their availability fluctuates highly. Guest jobs may fail because of unexpected resource unavailability. To provide fault tolerance to guest jobs without adding significant computational overhead, it requires to predict future resource availability. This paper presents a method for resource availability prediction in FGCS systems. It applies a semi-Markov Process and is based on a novel resource availability model, combining generic hardware-software failures with domain-specific resource behavior in FGCS. We describe the prediction framework and its implementation in a production FGCS system named iShare. Through the experiments on an iShare testbed, we demonstrate that the prediction achieves accuracy above 86% on average and outperforms linear time series models, while the computational cost is negligible. Our experimental results also show that the prediction is robust in the presence of irregular resource unavailability
Keywords :
Internet; Markov processes; fault tolerant computing; resource allocation; Internet; domain-specific resource behavior; fault tolerance; fine-grained cycle sharing system; generic hardware-software failure; iShare FGCS system; resource availability prediction; semiMarkov process; Accuracy; Availability; Central Processing Unit; Computational efficiency; Computational modeling; Fault tolerance; Internet; Predictive models; Production systems; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
High Performance Distributed Computing, 2006 15th IEEE International Symposium on
Conference_Location :
Paris
ISSN :
1082-8907
Print_ISBN :
1-4244-0307-3
Type :
conf
DOI :
10.1109/HPDC.2006.1652140
Filename :
1652140
Link To Document :
بازگشت