DocumentCode
3386323
Title
Attribute Selection in Software Engineering Datasets for Detecting Fault Modules
Author
Rodríguez, D. ; Ruiz, R. ; Cuadrado-Gallego, J. ; Aguilar-Ruiz, J. ; Garre, M.
Author_Institution
Univ. of Alcala Ctra., Madrid
fYear
2007
fDate
28-31 Aug. 2007
Firstpage
418
Lastpage
423
Abstract
Decision making has been traditionally based on managers experience. At present, there is a number of software engineering (SE) repositories, and furthermore, automated data collection tools allow managers to collect large amounts of information, not without associated problems. On the one hand, such a large amount of information can overload project managers. On the other hand, problems found in generic project databases, where the data is collected from different organizations, is the large disparity of its instances. In this paper, we characterize several software engineering databases selecting attributes with the final aim that project managers can have a better global vision of the data they manage. In this paper, we make use of different data mining algorithms to select attributes from the different datasets publicly available (PROMISE repository), and then, use different classifiers to defect faulty modules. The results show that in general, the smaller datasets maintain the prediction capability with a lower number of attributes than the original datasets.
Keywords
data mining; project management; software fault tolerance; data mining algorithm; decision making; fault module detection; project manager; software engineering database; Computer science; Data mining; Databases; Decision making; Engineering management; Fault detection; Programming; Project management; Software engineering; Software tools;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering and Advanced Applications, 2007. 33rd EUROMICRO Conference on
Conference_Location
Lubeck
ISSN
1089-6503
Print_ISBN
978-0-7695-2977-6
Type
conf
DOI
10.1109/EUROMICRO.2007.20
Filename
4301106
Link To Document