• DocumentCode
    1820012
  • Title

    Data mining approaches to software fault diagnosis

  • Author

    Bose, R. P Jagadeesh Chandra ; Srinivasan, S.H.

  • Author_Institution
    Appl. Res. Group, Satyam Comput. Services Ltd, Bangalore, India
  • fYear
    2005
  • fDate
    3-4 April 2005
  • Firstpage
    45
  • Lastpage
    52
  • Abstract
    Automatic identification of software faults has enormous practical significance. This requires characterizing program execution behavior and the use of appropriate data mining techniques on the chosen representation. In this paper we use the sequence of system calls to characterize program execution. The data mining tasks addressed are learning to map system call streams to fault labels and automatic identification of fault causes. Spectrum kernels and SVM are used for the former while latent semantic analysis is used for the latter The techniques are demonstrated for the intrusion dataset containing system call traces. The results show that kernel techniques are as accurate as the best available results but are faster by orders of magnitude. We also show that latent semantic indexing is capable of revealing fault-specific features.
  • Keywords
    data mining; fault diagnosis; software reliability; system monitoring; SVM; automatic software fault identification; data mining techniques; intrusion dataset; latent semantic indexing; program execution behavior; software fault diagnosis; spectrum kernels; support vector machines; system call streams; system call traces; system calls; Data mining; Databases; Fault detection; Fault diagnosis; Indexing; Kernel; Robustness; Search engines; Support vector machines; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research Issues in Data Engineering: Stream Data Mining and Applications, 2005. RIDE-SDMA 2005. 15th International Workshop on
  • ISSN
    1097-8585
  • Print_ISBN
    0-7695-2390-0
  • Type

    conf

  • DOI
    10.1109/RIDE.2005.9
  • Filename
    1498230