DocumentCode
1820012
Title
Data mining approaches to software fault diagnosis
Author
Bose, R. P Jagadeesh Chandra ; Srinivasan, S.H.
Author_Institution
Appl. Res. Group, Satyam Comput. Services Ltd, Bangalore, India
fYear
2005
fDate
3-4 April 2005
Firstpage
45
Lastpage
52
Abstract
Automatic identification of software faults has enormous practical significance. This requires characterizing program execution behavior and the use of appropriate data mining techniques on the chosen representation. In this paper we use the sequence of system calls to characterize program execution. The data mining tasks addressed are learning to map system call streams to fault labels and automatic identification of fault causes. Spectrum kernels and SVM are used for the former while latent semantic analysis is used for the latter The techniques are demonstrated for the intrusion dataset containing system call traces. The results show that kernel techniques are as accurate as the best available results but are faster by orders of magnitude. We also show that latent semantic indexing is capable of revealing fault-specific features.
Keywords
data mining; fault diagnosis; software reliability; system monitoring; SVM; automatic software fault identification; data mining techniques; intrusion dataset; latent semantic indexing; program execution behavior; software fault diagnosis; spectrum kernels; support vector machines; system call streams; system call traces; system calls; Data mining; Databases; Fault detection; Fault diagnosis; Indexing; Kernel; Robustness; Search engines; Support vector machines; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Research Issues in Data Engineering: Stream Data Mining and Applications, 2005. RIDE-SDMA 2005. 15th International Workshop on
ISSN
1097-8585
Print_ISBN
0-7695-2390-0
Type
conf
DOI
10.1109/RIDE.2005.9
Filename
1498230
Link To Document