DocumentCode :
3658666
Title :
FECS: A Cluster Based Feature Selection Method for Software Fault Prediction with Noises
Author :
Wangshu Liu;Shulong Liu;Qing Gu;Xiang Chen;Daoxu Chen
Author_Institution :
State Key Lab. for Novel Software Technol., Nanjing Univ., Nanjing, China
Volume :
2
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
276
Lastpage :
281
Abstract :
Noises are inevitable when mining software archives for software fault prediction. Although some researchers have investigated the noise tolerance of existing feature selection methods, few studies focus on proposing new feature selection methods with a certain noise tolerance. To solve this issue, we propose a novel method FECS (FEature Clustering with Selection strategies). This method includes two phases: a feature clustering phase and a feature selection phase with three different heuristic search strategies. During empirical studies, we choose real-world software projects, such as Eclipse and NASA and inject class level and feature level noises simultaneously to imitate noisy datasets. After using classical feature selection methods as the baseline, we confirm the effectiveness of FECS and provide a guideline of using FECS after analyzing the effects of varying either the percentage of selected features or the noise rate.
Keywords :
"Noise","Software","Noise measurement","Data mining","Loss measurement","Correlation"
Publisher :
ieee
Conference_Titel :
Computer Software and Applications Conference (COMPSAC), 2015 IEEE 39th Annual
Electronic_ISBN :
0730-3157
Type :
conf
DOI :
10.1109/COMPSAC.2015.66
Filename :
7273629
Link To Document :
بازگشت