Title :
Multi-objective Cross-Project Defect Prediction
Author :
Canfora, Gerardo ; De Lucia, Andrea ; Di Penta, Massimiliano ; Oliveto, Rocco ; Panichella, A. ; Panichella, Sebastiano
Author_Institution :
Univ. of Sannio, Benevento, Italy
Abstract :
Cross-project defect prediction is very appealing because (i) it allows predicting defects in projects for which the availability of data is limited, and (ii) it allows producing generalizable prediction models. However, existing research suggests that cross-project prediction is particularly challenging and, due to heterogeneity of projects, prediction accuracy is not always very good. This paper proposes a novel, multi-objective approach for cross-project defect prediction, based on a multi-objective logistic regression model built using a genetic algorithm. Instead of providing the software engineer with a single predictive model, the multi-objective approach allows software engineers to choose predictors achieving a compromise between number of likely defect-prone artifacts (effectiveness) and LOC to be analyzed/tested (which can be considered as a proxy of the cost of code inspection). Results of an empirical evaluation on 10 datasets from the Promise repository indicate the superiority and the usefulness of the multi-objective approach with respect to single-objective predictors. Also, the proposed approach outperforms an alternative approach for cross-project prediction, based on local prediction upon clusters of similar classes.
Keywords :
genetic algorithms; pattern clustering; regression analysis; search problems; software engineering; software management; GA; LOC analysis; LOC testing; Promise repository; defect-prone artifacts; generalizable prediction models; genetic algorithm; multiobjective cross-project defect prediction accuracy; multiobjective logistic regression model; project heterogeneity; software engineers; Accuracy; Data models; Inspection; Logistics; Measurement; Predictive models; Software; Cross-project defect prediction; multi-objective optimization; search-based software engineering;
Conference_Titel :
Software Testing, Verification and Validation (ICST), 2013 IEEE Sixth International Conference on
Conference_Location :
Luembourg
Print_ISBN :
978-1-4673-5961-0
DOI :
10.1109/ICST.2013.38