DocumentCode :
525638
Title :
A data mining based method: Detecting software defects in source code
Author :
Zhang, Yuehua ; Liu, Ying ; Zhang, Lingling ; Shi, Yong
fYear :
2010
fDate :
23-25 June 2010
Firstpage :
607
Lastpage :
612
Abstract :
With the expansion of software size and complexity, how to detect defects becomes a challenging problem. This paper proposes a defect detection method which applies data mining techniques in source code to detect two types of defects in one process. The two types of defects are rule-violating defects and copy-paste related defects which may include semantic defects. During the process, this method can also extract implicit programming rules without prior knowledge of the software and detect copy-paste segments with different granularities. The method is evaluated with the Linux kernel that contains more than 4 million lines of C code. The result shows that the resulting system can quickly detect many programming rules and violations to the rules. After using the novel pruning techniques, it will greatly reduce the effort of manually checking violations so as a large number of false positives are effectively eliminated. As an illustrative example of its effectiveness, a case study shows that among the top 50 violations reported by the proposed model, 11 defects can be confirmed after examining the source code.
Keywords :
C language; Linux; data mining; program testing; software metrics; software quality; source coding; C code; Linux kernel; copy-paste related defect; data mining; programming rule; rule-violating defect; software complexity; software defect detection; source code; Computer industry; Data mining; Educational institutions; Functional programming; Information science; Job shop scheduling; Kernel; Linux; Mining industry; Programming profession; copy-paste; data mining; defect detection; false positive; programming rule;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering and Data Mining (SEDM), 2010 2nd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-7324-3
Electronic_ISBN :
978-89-88678-22-0
Type :
conf
Filename :
5542852
Link To Document :
بازگشت