Title :
An Empirical Study on Improving Severity Prediction of Defect Reports Using Feature Selection
Author :
Cheng-Zen Yang ; Chun-Chi Hou ; Wei-Chen Kao ; Ing-Xiang Chen
Author_Institution :
Dept. of Comput. Sci. & Eng., Yuan Ze Univ., Chungli, Taiwan
Abstract :
In software maintenance, severity prediction on defect reports is an emerging issue obtaining research attention due to the considerable triaging cost. In the past research work, several text mining approaches have been proposed to predict the severity using advanced learning models. Although these approaches demonstrate the effectiveness of predicting the severity, they do not discuss the problem of how to find the indicators in good quality. In this paper, we discuss whether feature selection can benefit the severity prediction task with three commonly used feature selection schemes, Information Gain, Chi-Square, and Correlation Coefficient, based on the Multinomial Naive Bayes classification approach. We have conducted empirical experiments with four open-source components from Eclipse and Mozilla. The experimental results show that these three feature selection schemes can further improve the predication performance in over half the cases.
Keywords :
Bayes methods; data mining; learning (artificial intelligence); pattern classification; public domain software; software maintenance; Eclipse; Mozilla; advanced learning models; chi-square; correlation coefficient; defect reports; feature selection; information gain; multinomial naive Bayes classification approach; open-source components; severity prediction; software maintenance; text mining approaches; triaging cost; Feature extraction; Frequency measurement; Niobium; Predictive models; Software; Text mining; Training; Defect Reports; Feature Selection; Performance Evaluation; Severity Prediction;
Conference_Titel :
Software Engineering Conference (APSEC), 2012 19th Asia-Pacific
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4673-4930-7
DOI :
10.1109/APSEC.2012.144