• DocumentCode
    2621028
  • Title

    Detecting Software Faults in Distrubted Systems

  • Author

    Burrell, A.T. ; Papantoni-Kazakos, P.

  • Author_Institution
    Comput. Sci. Dept., Oklahoma State Univ., Stillwater, OK, USA
  • Volume
    7
  • fYear
    2009
  • fDate
    March 31 2009-April 2 2009
  • Firstpage
    300
  • Lastpage
    304
  • Abstract
    We are concerned with the problem of detecting faults in distributed software, rapidly and accurately. We assume that the software is characterized by events or attributes, which determine operational modes; some of these modes may be identified as failures. We assume that these events are known and that their probabilistic structure, in their chronological evolution, is also known, for a finite set of different operational modes. We propose and analyze a sequential algorithm that detects changes in operational modes rapidly and reliably. Further more, a threshold operational parameter of the algorithm controls effectively the induced speed versus correct detection versus false detection tradeoff.
  • Keywords
    parallel algorithms; probability; program diagnostics; software fault tolerance; software maintenance; change detection; chronological evolution; distributed system; operational mode; parallel algorithm; probabilistic structure; sequential algorithm; software fault detection; software reliability; threshold operational parameter; Algorithm design and analysis; Change detection algorithms; Computer science; Electrical fault detection; Fault detection; Measurement; Parallel algorithms; Stochastic processes; Fault recognition; detection of change; sequential algorithms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Engineering, 2009 WRI World Congress on
  • Conference_Location
    Los Angeles, CA
  • Print_ISBN
    978-0-7695-3507-4
  • Type

    conf

  • DOI
    10.1109/CSIE.2009.464
  • Filename
    5170330