DocumentCode
2891906
Title
The State of Machine Learning Methodology in Software Fault Prediction
Author
Hall, T. ; Bowes, D.
Author_Institution
Dept. of Inf. Syst. & Comput., Brunel Univ., Uxbridge, UK
Volume
2
fYear
2012
fDate
12-15 Dec. 2012
Firstpage
308
Lastpage
313
Abstract
The aim of this paper is to investigate the quality of methodology in software fault prediction studies using machine learning. Over two hundred studies of fault prediction have been published in the last 10 years. There is evidence to suggest that the quality of methodology used in some of these studies does not allow us to have confidence in the predictions reported by them. We evaluate the machine learning methodology used in 21 fault prediction studies. All of these studies use NASA data sets. We score each study from 1 to 10 in terms of the quality of their machine learning methodology (e.g. whether or not studies report randomising their cross validation folds). Only 10 out of the 21 studies scored 5 or more out of 10. Furthermore 1 study scored only 1 out of 10. When we plot these scores over time there is no evidence that the quality of machine learning methodology is better in recent studies. Our results suggest that there remains much to be done by both researchers and reviewers to improve the quality of machine learning methodology used in software fault prediction. We conclude that the results reported in some studies need to be treated with caution.
Keywords
learning (artificial intelligence); program testing; software fault tolerance; machine learning methodology; software fault prediction; Cleaning; Data models; Machine learning; NASA; Predictive models; Software; Software engineering; experimental techniques; fault prediction; machine learning; methodology; software engineering;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location
Boca Raton, FL
Print_ISBN
978-1-4673-4651-1
Type
conf
DOI
10.1109/ICMLA.2012.226
Filename
6406713
Link To Document