DocumentCode
3072434
Title
A Taxonomy of Data Quality Challenges in Empirical Software Engineering
Author
Bosu, Michael Franklin ; Macdonell, Stephen G.
Author_Institution
Sch. of Comput. & Math. Sci., Auckland Univ. of Technol., Auckland, New Zealand
fYear
2013
fDate
4-7 June 2013
Firstpage
97
Lastpage
106
Abstract
Reliable empirical models such as those used in software effort estimation or defect prediction are inherently dependent on the data from which they are built. As demands for process and product improvement continue to grow, the quality of the data used in measurement and prediction systems warrants increasingly close scrutiny. In this paper we propose a taxonomy of data quality challenges in empirical software engineering, based on an extensive review of prior research. We consider current assessment techniques for each quality issue and proposed mechanisms to address these issues, where available. Our taxonomy classifies data quality issues into three broad areas: first, characteristics of data that mean they are not fit for modeling, second, data set characteristics that lead to concerns about the suitability of applying a given model to another data set, and third, factors that prevent or limit data accessibility and trust. We identify this latter area as of particular need in terms of further research.
Keywords
software cost estimation; software fault tolerance; software quality; assessment techniques; data accessibility; data quality; data set characteristics; defect prediction; empirical software engineering; product improvement; reliable empirical models; software effort estimation; trust; Accuracy; Data models; Measurement; Noise; Software; Software engineering; Taxonomy; accessibility; commercial sensitivity; data quality; empirical software engineering; provenance; trustworthiness;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering Conference (ASWEC), 2013 22nd Australian
Conference_Location
Melbourne, VIC
ISSN
1530-0803
Type
conf
DOI
10.1109/ASWEC.2013.21
Filename
6601297
Link To Document