DocumentCode
2621028
Title
Detecting Software Faults in Distrubted Systems
Author
Burrell, A.T. ; Papantoni-Kazakos, P.
Author_Institution
Comput. Sci. Dept., Oklahoma State Univ., Stillwater, OK, USA
Volume
7
fYear
2009
fDate
March 31 2009-April 2 2009
Firstpage
300
Lastpage
304
Abstract
We are concerned with the problem of detecting faults in distributed software, rapidly and accurately. We assume that the software is characterized by events or attributes, which determine operational modes; some of these modes may be identified as failures. We assume that these events are known and that their probabilistic structure, in their chronological evolution, is also known, for a finite set of different operational modes. We propose and analyze a sequential algorithm that detects changes in operational modes rapidly and reliably. Further more, a threshold operational parameter of the algorithm controls effectively the induced speed versus correct detection versus false detection tradeoff.
Keywords
parallel algorithms; probability; program diagnostics; software fault tolerance; software maintenance; change detection; chronological evolution; distributed system; operational mode; parallel algorithm; probabilistic structure; sequential algorithm; software fault detection; software reliability; threshold operational parameter; Algorithm design and analysis; Change detection algorithms; Computer science; Electrical fault detection; Fault detection; Measurement; Parallel algorithms; Stochastic processes; Fault recognition; detection of change; sequential algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Engineering, 2009 WRI World Congress on
Conference_Location
Los Angeles, CA
Print_ISBN
978-0-7695-3507-4
Type
conf
DOI
10.1109/CSIE.2009.464
Filename
5170330
Link To Document