DocumentCode
2353533
Title
Performance Analysis of Datamining Algorithms for Software Quality Prediction
Author
Gayatri, N. ; Nickolas, S. ; Reddy, A.V. ; Chitra, R.
Author_Institution
Dept. of Comput. Applic., Nat. Inst. of Technol., Tiruchirappalli, India
fYear
2009
fDate
27-28 Oct. 2009
Firstpage
393
Lastpage
395
Abstract
Data mining techniques are applied in building software fault prediction models for improving the software quality. Early identification of high-risk modules can assist in quality enhancement efforts to modules that are likely to have a high number of faults. Classification tree models are simple and effective as software quality prediction models, and timely predictions of defects from such models can be used to achieve high software reliability. In this paper, the performance of five data mining classifier algorithms named J48, CART, Random Forest, BFTree and Naive Bayesian classifier (NBC) are evaluated based on 10 fold cross validation test. Experimental results using KC2 NASA software metrics dataset demonstrates that decision trees are much useful for fault predictions and based on rules generated only some measurement attributes in the given set of the metrics play an important role in establishing final rules and for improving the software quality by giving correct predictions. Thus we can suggest that these attributes are sufficient for future classification process. To evaluate the performance of the above algorithms Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Receiver Operating Characteristic (ROC) and Accuracy measures are applied.
Keywords
data mining; pattern classification; software performance evaluation; software quality; software reliability; BFTree; CART; J48; accuracy measure; classification tree model; data mining; mean absolute error; naive Bayesian classifier; performance analysis; quality enhancement; random forest; receiver operating characteristic; root mean squared error; software fault prediction model; software quality prediction; software reliability; Bayesian methods; Classification tree analysis; Data mining; Fault diagnosis; Niobium compounds; Performance analysis; Predictive models; Software algorithms; Software quality; Software reliability; BFTree; CART; Classification; CrossValidation; J48; Naive Bayesian; Random forest; Software Quality;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Recent Technologies in Communication and Computing, 2009. ARTCom '09. International Conference on
Conference_Location
Kottayam, Kerala
Print_ISBN
978-1-4244-5104-3
Electronic_ISBN
978-0-7695-3845-7
Type
conf
DOI
10.1109/ARTCom.2009.12
Filename
5329377
Link To Document