DocumentCode :
2923919
Title :
A Hybrid Approach to Cleansing Software Measurement Data
Author :
Khoshgoftaar, Taghi M. ; Van Hulse, Jason ; Seiffert, Chris
Author_Institution :
Florida Atlantic Univ., Boca Raton, FL
fYear :
2006
fDate :
Nov. 2006
Firstpage :
713
Lastpage :
722
Abstract :
Data is extremely important in empirical software engineering. Techniques that provide insight into potential anomalies or inaccuracies in a dataset are becoming an increasingly important way for a data analyst to cope with flawed data. We present a novel hybrid procedure for quantitative outcome correction along with controlled experiments using a real-world software measurement dataset to demonstrate the usefulness of our technique. Instances that are deemed to be noisy relative to the dependent variable, which represents the number of faults recorded in the program module, are cleansed by replacing the original value with a more appropriate alternative value
Keywords :
data integrity; software quality; data analyst; dataset anomalies; dataset inaccuracies; empirical software engineering; flawed data; program module; quantitative outcome correction; real-world software measurement dataset; software measurement data cleansing; Application software; Data analysis; Data mining; Databases; Information systems; Machine learning; Mathematical model; Software engineering; Software measurement; Software quality;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence, 2006. ICTAI '06. 18th IEEE International Conference on
Conference_Location :
Arlington, VA
ISSN :
1082-3409
Print_ISBN :
0-7695-2728-0
Type :
conf
DOI :
10.1109/ICTAI.2006.11
Filename :
4031964
Link To Document :
بازگشت