• DocumentCode
    2210862
  • Title

    Improving software fault-prediction for imbalanced data

  • Author

    Shatnawi, Raed

  • Author_Institution
    Software Eng. Dept., Jordan Univ. of Sci. & Technol., Irbid, Jordan
  • fYear
    2012
  • fDate
    18-20 March 2012
  • Firstpage
    54
  • Lastpage
    59
  • Abstract
    Fault-proneness has been studied extensively as a quality factor. The prediction of fault-proneness of software modules can help software engineers to plan evolutions of the system. This plan can be compromised in case prediction models are biased or do not have high prediction performance. One major issue that can impact the prediction performance is the fault distributions such as the data imbalance, i.e., the majority of modules are faultless whereas the minority of modules is only faulty. In this paper, we propose to use the fault content (i.e., the number of faults in a module) to oversample the minority. We applied this technique on a large object-oriented system - Eclipse. The proposed oversampling is tested on three classifiers. The results have shown a better prediction performance than other traditional oversampling techniques. The oversampling technique is more convenient than other sampling techniques because it´s guided by information provided from the software history.
  • Keywords
    data handling; object-oriented programming; software fault tolerance; Eclipse; fault distributions; fault-proneness; imbalanced data; object-oriented system; oversampling technique; software fault-prediction; software history; software modules; Data models; Measurement; Object oriented modeling; Predictive models; Software engineering; Software quality; CK metrics; ROC curve; data mining; fault-proneness; imbalanced data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovations in Information Technology (IIT), 2012 International Conference on
  • Conference_Location
    Abu Dhabi
  • Print_ISBN
    978-1-4673-1100-7
  • Type

    conf

  • DOI
    10.1109/INNOVATIONS.2012.6207774
  • Filename
    6207774