Title :
Building a Novel GP-Based Software Quality Classifier Using Multiple Validation Datasets
Author :
Liu, Yi ; Khoshgoftaar, Taghi ; Yao, Jenq-Foung
Author_Institution :
Georgia Coll. & State Univ., Milledgeville
Abstract :
One problem associated with software quality classification (SQC) modeling is that the historical metric dataset obtained from a single software project are often not adequate to build robust and accurate models. To address this issue, multiple datasets obtained from different software projects are used for SQC modeling in recent research works. Our previous study has demonstrated that using multiple datasets for validation can achieve robust genetic programming (GP)-based SQC models. This paper further investigates the effectiveness of using multiple validation datasets. Moreover, a novel GP-based classifier consisting of training, multiple-dataset validation, and voting phases, is proposed. The experiments are carried out on seven NASA software projects. The results are compared with the results achieved by seventeen other data mining techniques. The comparisons demonstrate that the performance of our approach is significantly better by using multiple datasets from different software projects with similar reliability goals.
Keywords :
genetic algorithms; pattern classification; software management; software metrics; software quality; NASA software project; data mining; multiple validation dataset; multiple-dataset validation; robust genetic programming; similar reliability goal; software quality classification modeling; software quality classifier; Data mining; Educational institutions; Genetic programming; NASA; Project management; Robustness; Software metrics; Software quality; System testing; Voting; genetic programming; model selection; multiple datasets; paired t-test; software metrics; software quality classification; validation;
Conference_Titel :
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
Conference_Location :
Las Vegas, IL
Print_ISBN :
1-4244-1500-4
Electronic_ISBN :
1-4244-1500-4
DOI :
10.1109/IRI.2007.4296693