Title :
Detecting Fault Modules Applying Feature Selection to Classifiers
Author :
Rodríguez, D. ; Ruiz, R. ; Cuadrado-Gallego, J. ; Aguilar-Ruiz, J.
Author_Institution :
Alcala Univ., Madrid
Abstract :
At present, automated data collection tools allow us to collect large amounts of information, not without associated problems. This paper, we apply feature selection to several software engineering databases selecting attributes with the final aim that project managers can have a better global vision of the data they manage. In this paper, we make use of attribute selection techniques in different datasets publicly available (PROMISE repository), and different data mining algorithms for classification to defect faulty modules. The results show that in general, smaller datasets with less attributes maintain or improve the prediction capability with less attributes than the original datasets.
Keywords :
data mining; feature extraction; learning (artificial intelligence); pattern classification; project management; software management; PROMISE repository; attribute selection techniques; automated data collection tools; classifier learning; data mining algorithms; fault module detection; feature selection; project management; software engineering databases; Application software; Computer science; Costs; Data mining; Engineering management; Fault detection; Filters; Project management; Software engineering; Spatial databases;
Conference_Titel :
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
Conference_Location :
Las Vegas, IL
Print_ISBN :
1-4244-1500-4
Electronic_ISBN :
1-4244-1500-4
DOI :
10.1109/IRI.2007.4296696