• DocumentCode
    243551
  • Title

    Examination of Reliability of Missing Value Recovery in Data Mining

  • Author

    Shigang Liu ; Honghua Dai

  • Author_Institution
    Sch. of Inf. Technol., Deakin Univ., Melbourne, VIC, Australia
  • fYear
    2014
  • fDate
    14-14 Dec. 2014
  • Firstpage
    306
  • Lastpage
    313
  • Abstract
    Missing data imputation is an important task in cases where it is crucial to use all available data and no discard records with missing values. However, most of the existing algorithms are focused on missing at random (MAR) or missing completely at random (MCAR). In this paper, an information decomposition imputation (IDIM) algorithm using fuzzy membership function is proposed for addressing the missing value problem which is not missing at random (NMAR). The reliability of missing value recovery at not missing at random is examined. Firstly, this paper will discuss the proposed IDIM algorithm with detailed examples. Then, the reliability of the proposed approach is evaluated with extensive experiments compared with some typical algorithms, and the results demonstrate that the proposed algorithm has a higher accuracy rate than the exiting imputation methods in terms of normal root mean square error (NRMSE) and predictive accuracy at different set of data with missing values, which shows our method is more reliable in imputing missing values.
  • Keywords
    data mining; fuzzy set theory; mean square error methods; IDIM; MAR; MCAR; NMAR; NRMSE; data mining; fuzzy membership function; information decomposition imputation algorithm; missing at random; missing completely at random; missing data imputation; missing value recovery reliability; normal root mean square error; not missing at random; Accuracy; Algorithm design and analysis; Data mining; Educational institutions; Machine learning algorithms; Prediction algorithms; Reliability; Missing data imputation; information decomposition; not missing at random;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4275-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2014.84
  • Filename
    7022612