• DocumentCode
    60753
  • Title

    Identifying Recurring Faulty Functions in Field Traces of a Large Industrial Software System

  • Author

    Murtaza, Syed Shariyar ; Madhavji, Nazim H. ; Gittens, Mechelle ; Hamou-Lhadj, Abdelwahab

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Concordia Univ., Montreal, QC, Canada
  • Volume
    64
  • Issue
    1
  • fYear
    2015
  • fDate
    Mar-15
  • Firstpage
    269
  • Lastpage
    283
  • Abstract
    Software maintainers use the traces of field failures to understand and diagnose faulty functions that cause the system to fail. Despite their usefulness, traces from the field can be quite overwhelming, especially for software systems with a vast client base. In the execution of realistic applications, many of them being millions of lines of code, there are just too many traces that are generated. In addition, traces are known to be extraordinarily large, which further complicates matters. Fortunately, not all field failures are caused by new faults. In fact, previous studies showed that 50% to 90% of field failures are due to previously known faults. In this paper, we propose a machine learning approach that automatically detects recurring faulty functions in the traces of new field failures. We achieve our goal by training decision trees on earlier resolved traces of system failures from the current and prior releases of the system. When applied to a large industrial system with 20 million lines of code and 200,000 functions, our approach was able to detect recurring faulty functions in the traces of field failures with an accuracy of 90%, to even 97% in some cases.
  • Keywords
    software fault tolerance; software maintenance; fault function diagnosis; large industrial software system; recurring faulty function identification; software maintenance; Computer crashes; Decision trees; Fault diagnosis; Libraries; Software systems; Training; Crashing failures; decision tree; function call traces; non-crashing failures; recurrent faults; software maintenance;
  • fLanguage
    English
  • Journal_Title
    Reliability, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9529
  • Type

    jour

  • DOI
    10.1109/TR.2014.2366274
  • Filename
    6967866