DocumentCode :
2891906
Title :
The State of Machine Learning Methodology in Software Fault Prediction
Author :
Hall, T. ; Bowes, D.
Author_Institution :
Dept. of Inf. Syst. & Comput., Brunel Univ., Uxbridge, UK
Volume :
2
fYear :
2012
fDate :
12-15 Dec. 2012
Firstpage :
308
Lastpage :
313
Abstract :
The aim of this paper is to investigate the quality of methodology in software fault prediction studies using machine learning. Over two hundred studies of fault prediction have been published in the last 10 years. There is evidence to suggest that the quality of methodology used in some of these studies does not allow us to have confidence in the predictions reported by them. We evaluate the machine learning methodology used in 21 fault prediction studies. All of these studies use NASA data sets. We score each study from 1 to 10 in terms of the quality of their machine learning methodology (e.g. whether or not studies report randomising their cross validation folds). Only 10 out of the 21 studies scored 5 or more out of 10. Furthermore 1 study scored only 1 out of 10. When we plot these scores over time there is no evidence that the quality of machine learning methodology is better in recent studies. Our results suggest that there remains much to be done by both researchers and reviewers to improve the quality of machine learning methodology used in software fault prediction. We conclude that the results reported in some studies need to be treated with caution.
Keywords :
learning (artificial intelligence); program testing; software fault tolerance; machine learning methodology; software fault prediction; Cleaning; Data models; Machine learning; NASA; Predictive models; Software; Software engineering; experimental techniques; fault prediction; machine learning; methodology; software engineering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4673-4651-1
Type :
conf
DOI :
10.1109/ICMLA.2012.226
Filename :
6406713
Link To Document :
بازگشت