Title :
Assessing risk in Grids at resource level considering Grid resources as repairable using two state Semi Markov model
Author :
Sangrasi, Asif ; Djemame, Karim
Author_Institution :
Sch. of Comput., Univ. of Leeds, Leeds, UK
Abstract :
Service Level Agreements in Grids improve upon the Best Effort Approach which provides no guarantees for provision of any Quality of Service (QoS) between the End User and the Resource Provider. Risk Assessment in Grids improves upon SLA by provision of Risk information to resource provider. Most of the previous studies of Risk Assessment in Grids work at node level. As a node failure can be a failure of any component such as Disk, CPU, Memory, Software, etcetera, the risk assessment at component level in Grids was introduced. In this work, we propose a Risk Assessment Model at component level while considering Grid resources as repairable. This work can be differentiated from the other works by the fact that the past efforts in Risk Assessment in Grids consider Grid Resources as replaceable rather than repairable. This Semi Markov model relies on the distribution fitting for both time to Failure and Time to Repair, extracted from the Grid Failure data during the data analysis section. By using Grid Failure data, the utilization of this Grid model is demonstrated by providing (Probability of Failure) PoF and (Probability of Repair) PoR values for different components. The experimental results indicate the PoF and PoR behave vary differently with the latter showing considerably times required for repair as compared to expectance of a failure. The risk information provided by this Risk Assessment Model will help Resource provider to use the Grid Resources efficiently and achieve effective scheduling.
Keywords :
Markov processes; grid computing; probability; quality of service; risk management; software maintenance; system recovery; PoF; PoR; QoS; SLA; best effort approach; component level; data analysis section; distribution fitting; end user; grid failure data; grid model; node failure; node level; probability of failure; probability of repair; quality of service; repairable grid resources; resource level; resource provider; risk assessment model; risk information; scheduling; service level agreement; time-to-failure; time-to-repair; two state semi Markov model; Data models; Fitting; Maintenance engineering; Markov processes; Mathematical model; Risk management; Software; component level; grid computing; repairable resources; risk assessment; risk model;
Conference_Titel :
Digital Ecosystems Technologies (DEST), 2012 6th IEEE International Conference on
Conference_Location :
Campione d´Italia
Print_ISBN :
978-1-4673-1702-3
Electronic_ISBN :
2150-4938
DOI :
10.1109/DEST.2012.6227906