• DocumentCode
    3076092
  • Title

    Improving the Precision of Dependence-Based Defect Mining by Supervised Learning of Rule and Violation Graphs

  • Author

    Sun, Boya ; Podgurski, Andy ; Ray, Soumya

  • Author_Institution
    EECS Dept., Case Western Reserve Univ., Cleveland, OH, USA
  • fYear
    2010
  • fDate
    1-4 Nov. 2010
  • Firstpage
    1
  • Lastpage
    10
  • Abstract
    Previous work has shown that application of graph mining techniques to system dependence graphs improves the precision of automatic defect discovery by revealing subgraphs corresponding to implicit programming rules and to rule violations. However, developers must still confirm, edit, or discard reported rules and violations, which is both costly and error-prone. In order to reduce developer effort and further improve precision, we investigate the use of supervised learning models for classifying and ranking rule and violation subgraphs. In particular, we present and evaluate logistic regression models for rules and violations, respectively, which are based on general dependence-graph features. Our empirical results indicate that (i) use of these models can significantly improve the precision and recall of defect discovery, and (ii) our approach is superior to existing heuristic approaches to rule and violation ranking and to an existing static-warning classifier, and (iii) accurate models can be learned using only a few labeled examples.
  • Keywords
    data mining; graphs; learning (artificial intelligence); pattern classification; regression analysis; automatic defect discovery; dependence based defect mining; dependence graph feature; graph mining technique; implicit programming rule; logistic regression model; static warning classifier; supervised learning; violation graph; violation ranking; Classification algorithms; Computational modeling; Computer bugs; Data mining; Logistics; Measurement; Programming; defect classification; defect mining; dependence graph; logistic regression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Reliability Engineering (ISSRE), 2010 IEEE 21st International Symposium on
  • Conference_Location
    San Jose, CA
  • ISSN
    1071-9458
  • Print_ISBN
    978-1-4244-9056-1
  • Electronic_ISBN
    1071-9458
  • Type

    conf

  • DOI
    10.1109/ISSRE.2010.37
  • Filename
    5635105