DocumentCode :
635204
Title :
It´s not a bug, it´s a feature: How misclassification impacts bug prediction
Author :
Herzig, Kim ; Just, Sascha ; Zeller, A.
Author_Institution :
Saarland Univ., Saarbrücken, Germany
fYear :
2013
fDate :
18-26 May 2013
Firstpage :
392
Lastpage :
401
Abstract :
In a manual examination of more than 7,000 issue reports from the bug databases of five open-source projects, we found 33.8% of all bug reports to be misclassified - that is, rather than referring to a code fix, they resulted in a new feature, an update to documentation, or an internal refactoring. This misclassification introduces bias in bug prediction models, confusing bugs and features: On average, 39% of files marked as defective actually never had a bug. We discuss the impact of this misclassification on earlier studies and recommend manual data validation for future studies.
Keywords :
data mining; program debugging; software maintenance; bug prediction model; bug reports misclassification; documentation; internal refactoring; Computer bugs; Databases; Documentation; Inspection; Maintenance engineering; Manuals; Noise; Mining software repositories; bias; bug reports; data quality; noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering (ICSE), 2013 35th International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
978-1-4673-3073-2
Type :
conf
DOI :
10.1109/ICSE.2013.6606585
Filename :
6606585
Link To Document :
بازگشت