مرکز منطقه ای اطلاع رساني علوم و فناوري - Rigorous Performance Evaluation of Self-Stabilization Using Probabilistic Model Checking

DocumentCode :

2108448

Title :

Rigorous Performance Evaluation of Self-Stabilization Using Probabilistic Model Checking

Author :

Fallahi, Naser ; Bonakdarpour, Borzoo ; Tixeuil, Sebastien

Author_Institution :

Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON, Canada

fYear :

2013

fDate :

Sept. 30 2013-Oct. 3 2013

Firstpage :

153

Lastpage :

162

Abstract :

We propose a new metric for effectively and accurately evaluating the performance of self-stabilizing algorithms. Self-stabilization is a versatile category of fault-tolerance that guarantees system recovery to normal behavior within a finite number of steps, when the state of the system is perturbed by transient faults (or equally, the initial state of the system can be some arbitrary state). The performance of self-stabilizing algorithms is conventionally characterized in the literature by asymptotic computation complexity. We argue that such characterization of performance is too abstract and does not reflect accurately the realities of deploying a distributed algorithm in practice. Our new metric for characterizing the performance of self-stabilizing algorithms is the expected mean value of recovery time. Our metric has several crucial features. Firstly, it encodes accurate average case speed of recovery. Secondly, we show that our evaluation method can effectively incorporate several other parameters that are of importance in practice and have no place in asymptotic computation complexity. Examples include the type of distributed scheduler, likelihood of occurrence of faults, the impact of faults on speed of recovery, and network topology. We utilize a deep analysis technique, namely, probabilistic model checking to rigorously compute our proposed metric. All our claims are backed by detailed case studies and experiments.

Keywords :

computational complexity; distributed algorithms; probability; program verification; scheduling; software fault tolerance; software performance evaluation; system recovery; asymptotic computation complexity; deep analysis technique; distributed scheduler; fault occurrence likelihood; fault-tolerance; network topology; performance evaluation; probabilistic model checking; self-stabilization; self-stabilizing algorithms; system recovery; transient faults; Algorithm design and analysis; Distributed algorithms; Educational institutions; Markov processes; Measurement; Model checking; Probabilistic logic; Performance evaluation; Self-stabilization; formal methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Reliable Distributed Systems (SRDS), 2013 IEEE 32nd International Symposium on

Conference_Location :

Braga

Type :

conf

DOI :

10.1109/SRDS.2013.24

Filename :

6656271

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2108448