DocumentCode :
2208664
Title :
Software change classification using hunk metrics
Author :
Ferzund, Javed ; Ahsan, Syed Nadeem ; Wotawa, Franz
Author_Institution :
Inst. for Software Technol., Graz Univ. of Technol., Graz, Austria
fYear :
2009
fDate :
20-26 Sept. 2009
Firstpage :
471
Lastpage :
474
Abstract :
Change management is a challenging task in software maintenance. Changes are made to the software during its whole life. Some of these changes introduce errors in the code which result in failures. Software changes are composed of small code units called hunks, dispersed in source code files. In this paper we present a technique for classifying software changes based on hunk metrics. We classify individual hunks as buggy or bug-free, thus we provide an approach for bug prediction at the smallest level of granularity. We introduce a set of hunk metrics and build classification models based on these metrics. Classification models are built using logistic regression and random forests. We evaluated the performance of our approach on 7 open source software projects. Our classification approach can classify hunks as buggy or bug free with 81 percent accuracy, 77 percent buggy hunk precision and 67 percent buggy hunk recall on average. Most of the hunk metrics are significant predictors of bugs but the set of significant metrics varies among different projects.
Keywords :
management of change; regression analysis; software maintenance; software metrics; source coding; change management; hunk metrics; logistic regression; random forests; software change classification; software maintenance; source code files; Computer bugs; Data mining; Feature extraction; Logistics; Machine learning; Open source software; Predictive models; Software maintenance; Technology management;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Maintenance, 2009. ICSM 2009. IEEE International Conference on
Conference_Location :
Edmonton, AB
ISSN :
1063-6773
Print_ISBN :
978-1-4244-4897-5
Electronic_ISBN :
1063-6773
Type :
conf
DOI :
10.1109/ICSM.2009.5306274
Filename :
5306274
Link To Document :
بازگشت