• DocumentCode
    1333174
  • Title

    Finding Defective Software Modules by Means of Data Mining Techniques

  • Author

    Riquelme, J.C. ; Ruiz, Ricardo ; Rodriguez, D. ; Aguilar-Ruiz, J.S.

  • Author_Institution
    Dept. de Lenguajes y Sist., Univ. de Sevilla, Sevilla, Spain
  • Volume
    7
  • Issue
    3
  • fYear
    2009
  • fDate
    7/1/2009 12:00:00 AM
  • Firstpage
    377
  • Lastpage
    382
  • Abstract
    The characterization of defective modules in software engineering remains a challenge. In this work, we use data mining techniques to search for rules that indicate modules with a high probability of being defective. Using datasets from the PROMISE repository 1, we first applied feature selection to work only with those attributes from the datasets capable of predicting defective modules. Then, a genetic algorithm search for rules characterising subgroups with a high probability of being defective. This algorithm overcomes the problem of unbalanced datasets where the number of non-defective samples in the dataset highly outnumbers the defective ones.
  • Keywords
    data mining; genetic algorithms; probability; program diagnostics; search problems; software fault tolerance; PROMISE repository; data mining technique; feature selection; genetic algorithm search; nondefective sample; probability; software defect detection; software engineering; software module; unbalanced dataset; Data mining; Genetic algorithms; Predictive models; Silicon compounds; Software engineering; Defect detection and defect prediction in software modules; data mining;
  • fLanguage
    English
  • Journal_Title
    Latin America Transactions, IEEE (Revista IEEE America Latina)
  • Publisher
    ieee
  • ISSN
    1548-0992
  • Type

    jour

  • DOI
    10.1109/TLA.2009.5336637
  • Filename
    5336637