DocumentCode :
397647
Title :
A generic neural network approach for filling missing data in data mining
Author :
Wei, Wei ; Tang, Ying
Author_Institution :
Dept. of Comput. Sci., New Jersey Inst. of Technol., Newark, NJ, USA
Volume :
1
fYear :
2003
fDate :
5-8 Oct. 2003
Firstpage :
862
Abstract :
The recent advances in data mining have produced algorithms for extracting hidden and potentially useful knowledge in large data sets, which are assumed to be complete and reliable. However, data suitable for mining comes from various sources, has different formats, and can have missing or incorrect values. Incomplete data sets significantly distort mining results. Therefore, data preparation to taking care of missing or out-of-range values is very critical to knowledge discovery. This paper proposes a generic framework for missing data imputation using neural networks, where two-stage filling algorithms are implemented. An empirical evaluation of this method through a large credit card data set is performed.
Keywords :
credit transactions; data mining; data preparation; neural nets; very large databases; credit card data set; data mining; data preparation; filling algorithms; knowledge discovery; knowledge extraction; large data sets; missing data; neural network; Computer science; Data analysis; Data engineering; Data mining; Delta modulation; Filling; Intelligent networks; Knowledge engineering; Neural networks; Reliability engineering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2003. IEEE International Conference on
ISSN :
1062-922X
Print_ISBN :
0-7803-7952-7
Type :
conf
DOI :
10.1109/ICSMC.2003.1243923
Filename :
1243923
Link To Document :
بازگشت