DocumentCode
176169
Title
Combining Text Mining and Data Mining for Bug Report Classification
Author
Yu Zhou ; Yanxiang Tong ; Ruihang Gu ; Gall, H.
Author_Institution
Coll. of Comput. Sci., Nanjing Univ. of Aeronaut. & Astronaut., Nanjing, China
fYear
2014
fDate
Sept. 29 2014-Oct. 3 2014
Firstpage
311
Lastpage
320
Abstract
Misclassification of bug reports inevitably sacrifices the performance of bug prediction models. Manual examinations can help reduce the noise but bring a heavy burden for developers instead. In this paper, we propose a hybrid approach by combining both text mining and data mining techniques of bug report data to automate the prediction process. The first stage leverages text mining techniques to analyze the summary parts of bug reports and classifies them into three levels of probability. The extracted features and some other structured features of bug reports are then fed into the machine learner in the second stage. Data grafting techniques are employed to bridge the two stages. Comparative experiments with previous studies on the same data -- three large-scale open source projects -- consistently achieve a reasonable enhancement (from 77.4% to 81.7%, 73.9% to 80.2% and 87.4% to 93.7%, respectively) over their best results in terms of overall performance. Additional comparative empirical experiments on other two popular open source repositories confirm the findings and demonstrate the benefits of our approach.
Keywords
data mining; learning (artificial intelligence); program debugging; public domain software; text analysis; bug prediction models; bug report classification; data grafting techniques; data mining techniques; hybrid approach; large-scale open source projects; machine learner; open source repositories; summary parts; text mining techniques; Bayes methods; Feature extraction; Predictive models; Software; Text mining; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Maintenance and Evolution (ICSME), 2014 IEEE International Conference on
Conference_Location
Victoria, BC
ISSN
1063-6773
Type
conf
DOI
10.1109/ICSME.2014.53
Filename
6976097
Link To Document