DocumentCode
2657392
Title
Improving tree-based models of software quality with principal components analysis
Author
Khoshgoftaar, Taghi M. ; Shan, Ruqun M. ; Allen, Edward B.
Author_Institution
Florida Atlantic Univ., Boca Raton, FL, USA
fYear
2000
fDate
2000
Firstpage
198
Lastpage
209
Abstract
Software quality classification models can predict which modules are to be considered fault-prone, and which are not, based on software product metrics, process metrics and execution metrics. Such predictions can be used to target improvement efforts to those modules that need them the most. Classification-tree modeling is a robust technique for building such software quality models. However, the model structure may be unstable, and accuracy may suffer when the predictors are highly correlated. This paper presents an empirical case study of four releases of a very large telecommunications system, which shows that the tree-based models can be improved by transforming the predictors with principal components analysis, so that the transformed predictors are not correlated. The case study used the regression-tree algorithm in the S-Plus package and then applied a general decision rule to classify the modules
Keywords
pattern classification; principal component analysis; software metrics; software packages; software quality; subroutines; telecommunication computing; trees (mathematics); S-Plus package; accuracy; case study; classification-tree modeling; correlated predictors; decision rule; fault-prone modules; improvement efforts; principal components analysis; program module classification; regression-tree algorithm; software execution metrics; software process metrics; software product metrics; software quality classification models; telecommunications system; transformed predictors; tree-based models; unstable model structure; Accuracy; Buildings; Classification tree analysis; Logistics; Packaging; Predictive models; Principal component analysis; Robustness; Software metrics; Software quality;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Reliability Engineering, 2000. ISSRE 2000. Proceedings. 11th International Symposium on
Conference_Location
San Jose, CA
ISSN
1071-9458
Print_ISBN
0-7695-0807-3
Type
conf
DOI
10.1109/ISSRE.2000.885872
Filename
885872
Link To Document