• DocumentCode
    588645
  • Title

    Detection of Software Failures through Event Logs: An Experimental Study

  • Author

    Pecchia, Antonio ; Russo, S.

  • Author_Institution
    Dipt. di Inf. e Sist., Univ. degli Studi di Napoli Federico II, Naples, Italy
  • fYear
    2012
  • fDate
    27-30 Nov. 2012
  • Firstpage
    31
  • Lastpage
    40
  • Abstract
    Software faults are recognized to be among the main responsible for system failures in many application domains. Event logs play a key role to support the analysis of failures occurring under real workload conditions. Nevertheless, field experience suggests that event logs may be inaccurate at reporting software failures or they fail to provide accurate support for understanding their causes. This paper analyzes the factors that determine accurate detection of software failures through event logs. The study is based on a data set of 17,387 experiments where failures have been induced by means of software fault injection into three systems. Analysis reveals that the reporting ability of logs collected during the experiments, is not influenced by the type of fault that is activated at runtime. More importantly, analysis demonstrates that, despite the considered systems adopt very similar detection mechanisms, the ability of logs at reporting a given type of failure changes significantly across the systems. A closer inspection of collected logs reveals that characteristics, such as system architecture, placement of the logging instructions and specific supports provided by the execution environment, significantly increase accuracy of logs at runtime.
  • Keywords
    software architecture; software fault tolerance; event log reporting ability; execution environment; logging instruction placement; software failure detection; software fault injection; software fault recognition; system architecture; system software failure analysis; workload conditions; Analysis of variance; Detectors; Radio frequency; Runtime; Software systems; Web servers; ODC; detection mechanism; event log; failure; software fault;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Reliability Engineering (ISSRE), 2012 IEEE 23rd International Symposium on
  • Conference_Location
    Dallas, TX
  • ISSN
    1071-9458
  • Print_ISBN
    978-1-4673-4638-2
  • Type

    conf

  • DOI
    10.1109/ISSRE.2012.24
  • Filename
    6405402