DocumentCode :
243551
Title :
Examination of Reliability of Missing Value Recovery in Data Mining
Author :
Shigang Liu ; Honghua Dai
Author_Institution :
Sch. of Inf. Technol., Deakin Univ., Melbourne, VIC, Australia
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
306
Lastpage :
313
Abstract :
Missing data imputation is an important task in cases where it is crucial to use all available data and no discard records with missing values. However, most of the existing algorithms are focused on missing at random (MAR) or missing completely at random (MCAR). In this paper, an information decomposition imputation (IDIM) algorithm using fuzzy membership function is proposed for addressing the missing value problem which is not missing at random (NMAR). The reliability of missing value recovery at not missing at random is examined. Firstly, this paper will discuss the proposed IDIM algorithm with detailed examples. Then, the reliability of the proposed approach is evaluated with extensive experiments compared with some typical algorithms, and the results demonstrate that the proposed algorithm has a higher accuracy rate than the exiting imputation methods in terms of normal root mean square error (NRMSE) and predictive accuracy at different set of data with missing values, which shows our method is more reliable in imputing missing values.
Keywords :
data mining; fuzzy set theory; mean square error methods; IDIM; MAR; MCAR; NMAR; NRMSE; data mining; fuzzy membership function; information decomposition imputation algorithm; missing at random; missing completely at random; missing data imputation; missing value recovery reliability; normal root mean square error; not missing at random; Accuracy; Algorithm design and analysis; Data mining; Educational institutions; Machine learning algorithms; Prediction algorithms; Reliability; Missing data imputation; information decomposition; not missing at random;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.84
Filename :
7022612
Link To Document :
بازگشت