Title of article
Class noise detection based on software metrics and ROC curves
Author/Authors
Cagatay Catal، نويسنده , , Oral Alan، نويسنده , , Kerime Balkan، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2011
Pages
11
From page
4867
To page
4877
Abstract
Noise detection for software measurement datasets is a topic of growing interest. The presence of class and attribute noise in software measurement datasets degrades the performance of machine learning-based classifiers, and the identification of these noisy modules improves the overall performance. In this study, we propose a noise detection algorithm based on software metrics threshold values. The threshold values are obtained from the Receiver Operating Characteristic (ROC) analysis. This paper focuses on case studies of five public NASA datasets and details the construction of Naive Bayes-based software fault prediction models both before and after applying the proposed noise detection algorithm. Experimental results show that this noise detection approach is very effective for detecting the class noise and that the performance of fault predictors using a Naive Bayes algorithm with a logNum filter improves if the class labels of identified noisy modules are corrected.
Keywords
software quality , Noise detection , Receiver operating characteristic (ROC) curve , Software Metrics , Software fault prediction , Metric threshold values
Journal title
Information Sciences
Serial Year
2011
Journal title
Information Sciences
Record number
1214715
Link To Document