Title :
An application of a rule-based model in software quality classification
Author :
Bullard, L.A. ; Khoshgoftaar, T.M. ; Kehan Gao
Author_Institution :
Florida Atlantic Univ., Boca Raton
Abstract :
A new rule-based classification model (RBCM) and rule-based model selection technique are presented. The RBCM utilizes rough set theory to significantly reduce the number of attributes, discretation to partition the domain of attribute values, and Boolean predicates to generate the decision rules that comprise the model. When the domain values of an attribute are continuous and relatively large, rough set theory requires that they be discretized. The subsequent discretized domain must have the same characteristics as the original domain values. However, this can lead to a large number of partitions of the attribute´s domain space, which in turn leads to large rule sets. These rule sets tend to form models that over-fit. To address this issue, the proposed rule-based model adopts a new model selection strategy that minimizes over-fitting for the RBCM. Empirical validation of the RBCM is accomplished through a case study on a large legacy telecommunications system. The results demonstrate that the proposed RBCM and the model selection strategy are effective in identifying the classification model that minimizes over-fitting and high cost classification errors.
Keywords :
classification; knowledge based systems; rough set theory; software quality; Boolean predicates; rough set theory; rule-based classification model; rule-based model selection technique; software quality classification; Application software; Classification tree analysis; Costs; Fault diagnosis; Machine learning; Phase measurement; Set theory; Software measurement; Software metrics; Software quality;
Conference_Titel :
Machine Learning and Applications, 2007. ICMLA 2007. Sixth International Conference on
Conference_Location :
Cincinnati, OH
Print_ISBN :
978-0-7695-3069-7
DOI :
10.1109/ICMLA.2007.69