DocumentCode :
1820012
Title :
Data mining approaches to software fault diagnosis
Author :
Bose, R. P Jagadeesh Chandra ; Srinivasan, S.H.
Author_Institution :
Appl. Res. Group, Satyam Comput. Services Ltd, Bangalore, India
fYear :
2005
fDate :
3-4 April 2005
Firstpage :
45
Lastpage :
52
Abstract :
Automatic identification of software faults has enormous practical significance. This requires characterizing program execution behavior and the use of appropriate data mining techniques on the chosen representation. In this paper we use the sequence of system calls to characterize program execution. The data mining tasks addressed are learning to map system call streams to fault labels and automatic identification of fault causes. Spectrum kernels and SVM are used for the former while latent semantic analysis is used for the latter The techniques are demonstrated for the intrusion dataset containing system call traces. The results show that kernel techniques are as accurate as the best available results but are faster by orders of magnitude. We also show that latent semantic indexing is capable of revealing fault-specific features.
Keywords :
data mining; fault diagnosis; software reliability; system monitoring; SVM; automatic software fault identification; data mining techniques; intrusion dataset; latent semantic indexing; program execution behavior; software fault diagnosis; spectrum kernels; support vector machines; system call streams; system call traces; system calls; Data mining; Databases; Fault detection; Fault diagnosis; Indexing; Kernel; Robustness; Search engines; Support vector machines; Visualization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Research Issues in Data Engineering: Stream Data Mining and Applications, 2005. RIDE-SDMA 2005. 15th International Workshop on
ISSN :
1097-8585
Print_ISBN :
0-7695-2390-0
Type :
conf
DOI :
10.1109/RIDE.2005.9
Filename :
1498230
Link To Document :
بازگشت