DocumentCode :
2877862
Title :
Mining for statistical models of availability in large-scale distributed systems: An empirical study of SETI@home
Author :
Javadi, Bahman ; Kondo, Derrick ; Vincent, Jean-Marc ; Anderson, David P.
Author_Institution :
INRIA, Sophia-Antipolis, France
fYear :
2009
fDate :
21-23 Sept. 2009
Firstpage :
1
Lastpage :
10
Abstract :
In the age of cloud, Grid, P2P, and volunteer distributed computing, large-scale systems with tens of thousands of unreliable hosts are increasingly common. Invariably, these systems are composed of heterogeneous hosts whose individual availability often exhibit different statistical properties (for example stationary versus non-stationary behavior) and fit different models (for example Exponential, Weibull, or Pareto probability distributions). In this paper, we describe an effective method for discovering subsets of hosts whose availability have similar statistical properties and can be modelled with similar probability distributions. We apply this method with about 230,000 host availability traces obtained from a real large-scale Internet-distributed system, namely SETI@home. We find that about 34% of hosts exhibit availability that is a truly random process, and that these hosts can often be modelled accurately with a few distinct distributions from different families. We believe that this characterization is fundamental in the design of stochastic scheduling algorithms across large-scale systems where host availability is uncertain.
Keywords :
Internet; data mining; random processes; scheduling; statistical distributions; stochastic processes; P2P computing; cloud computing; grid computing; heterogeneous hosts; host availability traces; large-scale Internet distributed system; probability distribution; random process; statistical model; stochastic scheduling algorithm; volunteer distributed computing; Algorithm design and analysis; Availability; Clouds; Distributed computing; Hardware; Internet; Java; Large-scale systems; Probability distribution; Random processes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Modeling, Analysis & Simulation of Computer and Telecommunication Systems, 2009. MASCOTS '09. IEEE International Symposium on
Conference_Location :
London
ISSN :
1526-7539
Print_ISBN :
978-1-4244-4927-9
Electronic_ISBN :
1526-7539
Type :
conf
DOI :
10.1109/MASCOT.2009.5367061
Filename :
5367061
Link To Document :
بازگشت