DocumentCode
243551
Title
Examination of Reliability of Missing Value Recovery in Data Mining
Author
Shigang Liu ; Honghua Dai
Author_Institution
Sch. of Inf. Technol., Deakin Univ., Melbourne, VIC, Australia
fYear
2014
fDate
14-14 Dec. 2014
Firstpage
306
Lastpage
313
Abstract
Missing data imputation is an important task in cases where it is crucial to use all available data and no discard records with missing values. However, most of the existing algorithms are focused on missing at random (MAR) or missing completely at random (MCAR). In this paper, an information decomposition imputation (IDIM) algorithm using fuzzy membership function is proposed for addressing the missing value problem which is not missing at random (NMAR). The reliability of missing value recovery at not missing at random is examined. Firstly, this paper will discuss the proposed IDIM algorithm with detailed examples. Then, the reliability of the proposed approach is evaluated with extensive experiments compared with some typical algorithms, and the results demonstrate that the proposed algorithm has a higher accuracy rate than the exiting imputation methods in terms of normal root mean square error (NRMSE) and predictive accuracy at different set of data with missing values, which shows our method is more reliable in imputing missing values.
Keywords
data mining; fuzzy set theory; mean square error methods; IDIM; MAR; MCAR; NMAR; NRMSE; data mining; fuzzy membership function; information decomposition imputation algorithm; missing at random; missing completely at random; missing data imputation; missing value recovery reliability; normal root mean square error; not missing at random; Accuracy; Algorithm design and analysis; Data mining; Educational institutions; Machine learning algorithms; Prediction algorithms; Reliability; Missing data imputation; information decomposition; not missing at random;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location
Shenzhen
Print_ISBN
978-1-4799-4275-6
Type
conf
DOI
10.1109/ICDMW.2014.84
Filename
7022612
Link To Document