DocumentCode
3076092
Title
Improving the Precision of Dependence-Based Defect Mining by Supervised Learning of Rule and Violation Graphs
Author
Sun, Boya ; Podgurski, Andy ; Ray, Soumya
Author_Institution
EECS Dept., Case Western Reserve Univ., Cleveland, OH, USA
fYear
2010
fDate
1-4 Nov. 2010
Firstpage
1
Lastpage
10
Abstract
Previous work has shown that application of graph mining techniques to system dependence graphs improves the precision of automatic defect discovery by revealing subgraphs corresponding to implicit programming rules and to rule violations. However, developers must still confirm, edit, or discard reported rules and violations, which is both costly and error-prone. In order to reduce developer effort and further improve precision, we investigate the use of supervised learning models for classifying and ranking rule and violation subgraphs. In particular, we present and evaluate logistic regression models for rules and violations, respectively, which are based on general dependence-graph features. Our empirical results indicate that (i) use of these models can significantly improve the precision and recall of defect discovery, and (ii) our approach is superior to existing heuristic approaches to rule and violation ranking and to an existing static-warning classifier, and (iii) accurate models can be learned using only a few labeled examples.
Keywords
data mining; graphs; learning (artificial intelligence); pattern classification; regression analysis; automatic defect discovery; dependence based defect mining; dependence graph feature; graph mining technique; implicit programming rule; logistic regression model; static warning classifier; supervised learning; violation graph; violation ranking; Classification algorithms; Computational modeling; Computer bugs; Data mining; Logistics; Measurement; Programming; defect classification; defect mining; dependence graph; logistic regression;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Reliability Engineering (ISSRE), 2010 IEEE 21st International Symposium on
Conference_Location
San Jose, CA
ISSN
1071-9458
Print_ISBN
978-1-4244-9056-1
Electronic_ISBN
1071-9458
Type
conf
DOI
10.1109/ISSRE.2010.37
Filename
5635105
Link To Document