Title :
Comparing Mining Algorithms for Predicting the Severity of a Reported Bug
Author :
Lamkanfi, Ahmed ; Demeyer, Serge ; Soetens, Quinten David ; Verdonck, T.
Author_Institution :
LORE-Lab. On Reengineering, Univ. of Antwerp, Antwerp, Belgium
Abstract :
A critical item of a bug report is the so-called "severity", i.e. the impact the bug has on the successful execution of the software system. Consequently, tool support for the person reporting the bug in the form of a recommender or verification system is desirable. In previous work we made a first step towards such a tool: we demonstrated that text mining can predict the severity of a given bug report with a reasonable accuracy given a training set of sufficient size. In this paper we report on a follow-up study where we compare four well-known text mining algorithms (namely, Naive Bayes, Naive Bayes Multinomial, K-Nearest Neighbor and Support Vector Machines) with respect to accuracy and training set size. We discovered that for the cases under investigation (two open source systems: Eclipse and GNOME) Naive Bayes Multinomial performs superior compared to the other proposed algorithms.
Keywords :
data mining; program debugging; program verification; text analysis; Eclipse; GNOME; K-nearest neighbor; bug report; mining algorithm; naive Bayes multinomial; software system; support vector machines; text mining; verification system; Accuracy; Computer bugs; Prediction algorithms; Software; Text mining; Training; Bug Reports; Bug Severity; Bugzilla; Naive Bayes; Text Mining;
Conference_Titel :
Software Maintenance and Reengineering (CSMR), 2011 15th European Conference on
Conference_Location :
Oldenburg
Print_ISBN :
978-1-61284-259-2
DOI :
10.1109/CSMR.2011.31