DocumentCode :
2025180
Title :
Finding predictors of field defects for open source software systems in commonly available data sources: a case study of OpenBSD
Author :
Li, Paul Luo ; Herbsleb, Jim ; Shaw, Mary
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA
fYear :
2005
fDate :
1-1 Sept. 2005
Lastpage :
32
Abstract :
Open source software systems are important components of many business software applications. Field defect predictions for open source software systems may allow organizations to make informed decisions regarding open source software components. In this paper, we remotely measure and analyze predictors (metrics available before release) mined from established data sources (the code repository and the request tracking system) as well as a novel source of data (mailing list archives) for nine releases of OpenBSD. First, we attempt to predict field defects by extending a software reliability model fitted to development defects. We find this approach to be infeasible, which motivates examining metrics-based field defect prediction. Then, we evaluate 139 predictors using established statistical methods: Kendall´s rank correlation, Pearson´s rank correlation, and forward AIC model selection. The metrics we collect include product metrics, development metrics, deployment and usage metrics, and software and hardware configurations metrics. We find the number of messages to the technical discussion mailing list during the development period (a deployment and usage metric captured from mailing list archives) to be the best predictor of field defects. Our work identifies predictors of field defects in commonly available data sources for open source software systems and is a step towards metrics-based field defect prediction for quantitatively-based decision making regarding open source software components
Keywords :
configuration management; data mining; public domain software; software metrics; software quality; software reliability; statistical analysis; Kendall rank correlation; OpenBSD; Pearson rank correlation; business software applications; code repository; data mining; data sources; deployment metrics; development metrics; field defect prediction; forward AIC model selection; hardware configuration metrics; mailing list archives; open source software components; open source software systems; product metrics; request tracking system; software metrics; software reliability; statistical methods; usage metrics; Application software; Computer aided software engineering; Computer science; Decision making; Open source software; Operating systems; Permission; Predictive models; Software engineering; Software quality; CVS repository; Documentation; Experimentation; Field defect prediction; Measurement; Process metrics; Product metrics; Reliability; Software quality assurance; Software science; deployment and usage metrics; mailing list archives; open source software; reliability modeling; request tracking system; software and hardware configurations metrics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Metrics, 2005. 11th IEEE International Symposium
Conference_Location :
Como
ISSN :
1530-1435
Print_ISBN :
0-7695-2371-4
Type :
conf
DOI :
10.1109/METRICS.2005.26
Filename :
1509310
Link To Document :
بازگشت