Title :
Predicting the priority of a reported bug using machine learning techniques and cross project validation
Author :
Sharma, Mukesh ; Bedi, Punam ; Chaturvedi, K.K. ; Singh, V.B.
Author_Institution :
Dept. of Comput. Sci., Univ. of Delhi, Delhi, India
Abstract :
In bug repositories, we receive a large number of bug reports on daily basis. Managing such a large repository is a challenging job. Priority of a bug tells that how important and urgent it is for us to fix. Priority of a bug can be classified into 5 levels from PI to P5 where PI is the highest and P5 is the lowest priority. Correct prioritization of bugs helps in bug fix scheduling/assignment and resource allocation. Failure of this will result in delay of resolving important bugs. This requires a bug prediction system which can predict the priority of a newly reported bug. Cross project validation is also an important concern in empirical software engineering where we train classifier on one project and test it for prediction on other projects. In the available literature, we found very few papers for bug priority prediction and none of them dealt with cross project validation. In this paper, we have evaluated the performance of different machine learning techniques namely Support Vector Machine (SVM), Naive Bayes (NB), K-Nearest Neighbors (KNN) and Neural Network (NNet) in predicting the priority of the newly coming reports on the basis of different performance measures. We performed cross project validation for 76 cases of five data sets of open office and eclipse projects. The accuracy of different machine learning techniques in predicting the priority of a reported bug within and across project is found above 70% except Naive Bayes technique.
Keywords :
belief networks; learning (artificial intelligence); neural nets; pattern classification; program debugging; program testing; project management; resource allocation; support vector machines; KNN; NB; NNet; Naive Bayes classifier; SVM; bug fix scheduling; bug prediction system; bug prioritization; bug priority classification; bug repositories; cross project validation; eclipse projects; empirical software engineering; k-nearest neighbor classifier; machine learning techniques; neural network; open office projects; reported bug priority prediction; resource allocation; support vector machine; Decision support systems; Hafnium compounds; Intelligent systems; 10-fold Cross Validation; Bug priority; Bug repositories; Classifiers; KNN; Naive Bayes; Neural Net; SVM; Triager;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2012 12th International Conference on
Conference_Location :
Kochi
Print_ISBN :
978-1-4673-5117-1
DOI :
10.1109/ISDA.2012.6416595