• Title of article

    Missing data imputation using supervised learning methods

  • Author/Authors

    Rezaei Shiri, Behzad School of Mathematics - Statistics and Computer Science - College of Science - University of Tehran, Tehran, Iran , Eftekhari Mahabadi, Samaneh School of Mathematics - Statistics and Computer Science - College of Science - University of Tehran, Tehran, Iran

  • Pages
    10
  • From page
    103
  • To page
    112
  • Abstract
    Missing data is a very common problem in all research fields. Case deletion is a simple way to handle incomplete data sets which could mislead to biased statistical results. A more reliable approach to handle missing values is imputation which allows covariate-dependent missing mechanism, as well. This paper aims to prepare guidance for researchers facing missing data problems by comparing various imputation methods including machine learning techniques, to achieve better results in supervised learning tasks. A benchmark dataset has experimented and the results are compared by applying popular classifiers over varying missing mechanisms and rates on this benchmark dataset.
  • Keywords
    Imputation , Machine learning algorithms , Missing data , Missing mechanism
  • Journal title
    Journal of Statistical Modelling: Theory and Applications (JSMTA)
  • Serial Year
    2021
  • Record number

    2714896