• DocumentCode
    625525
  • Title

    Multi-objective Cross-Project Defect Prediction

  • Author

    Canfora, Gerardo ; De Lucia, Andrea ; Di Penta, Massimiliano ; Oliveto, Rocco ; Panichella, A. ; Panichella, Sebastiano

  • Author_Institution
    Univ. of Sannio, Benevento, Italy
  • fYear
    2013
  • fDate
    18-22 March 2013
  • Firstpage
    252
  • Lastpage
    261
  • Abstract
    Cross-project defect prediction is very appealing because (i) it allows predicting defects in projects for which the availability of data is limited, and (ii) it allows producing generalizable prediction models. However, existing research suggests that cross-project prediction is particularly challenging and, due to heterogeneity of projects, prediction accuracy is not always very good. This paper proposes a novel, multi-objective approach for cross-project defect prediction, based on a multi-objective logistic regression model built using a genetic algorithm. Instead of providing the software engineer with a single predictive model, the multi-objective approach allows software engineers to choose predictors achieving a compromise between number of likely defect-prone artifacts (effectiveness) and LOC to be analyzed/tested (which can be considered as a proxy of the cost of code inspection). Results of an empirical evaluation on 10 datasets from the Promise repository indicate the superiority and the usefulness of the multi-objective approach with respect to single-objective predictors. Also, the proposed approach outperforms an alternative approach for cross-project prediction, based on local prediction upon clusters of similar classes.
  • Keywords
    genetic algorithms; pattern clustering; regression analysis; search problems; software engineering; software management; GA; LOC analysis; LOC testing; Promise repository; defect-prone artifacts; generalizable prediction models; genetic algorithm; multiobjective cross-project defect prediction accuracy; multiobjective logistic regression model; project heterogeneity; software engineers; Accuracy; Data models; Inspection; Logistics; Measurement; Predictive models; Software; Cross-project defect prediction; multi-objective optimization; search-based software engineering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Testing, Verification and Validation (ICST), 2013 IEEE Sixth International Conference on
  • Conference_Location
    Luembourg
  • Print_ISBN
    978-1-4673-5961-0
  • Type

    conf

  • DOI
    10.1109/ICST.2013.38
  • Filename
    6569737