DocumentCode :
2918779
Title :
Software quality modeling: The impact of class noise on the random forest classifier
Author :
Folleco, Andres ; Khoshgoftaar, Taghi M. ; Van Hulse, Jason ; Bullard, Lofton
Author_Institution :
Dept. of Comput. Sci. & Eng., Florida Atlantic Univ., Boca Raton, FL
fYear :
2008
fDate :
1-6 June 2008
Firstpage :
3853
Lastpage :
3859
Abstract :
This study investigates the impact of increasing levels of simulated class noise on software quality classification. Class noise was injected into seven software engineering measurement datasets, and the performance of three learners, random forests, C4.5, and Naive Bayes, was analyzed. The random forest classifier was utilized for this study because of its strong performance relative to well-known and commonly-used classifiers such as C4.5 and Naive Bayes. Further, relatively little prior research in software quality classification has considered the random forest classifier. The experimental factors considered in this study were the level of class noise and the percent of minority instances injected with noise. The empirical results demonstrate that the random forest obtained the best and most consistent classification performance in all experiments.
Keywords :
Bayes methods; pattern classification; software metrics; software quality; C4.5; class noise; naive Bayes; random forest classifier; software quality classification; software quality modeling; Classification algorithms; Machine learning; Noise level; Noise measurement; Noise robustness; Radio frequency; Software algorithms; Software engineering; Software measurement; Software quality;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation, 2008. CEC 2008. (IEEE World Congress on Computational Intelligence). IEEE Congress on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1822-0
Electronic_ISBN :
978-1-4244-1823-7
Type :
conf
DOI :
10.1109/CEC.2008.4631321
Filename :
4631321
Link To Document :
بازگشت