Title of article
Missing data imputation using supervised learning methods
Author/Authors
Rezaei Shiri, Behzad School of Mathematics - Statistics and Computer Science - College of Science - University of Tehran, Tehran, Iran , Eftekhari Mahabadi, Samaneh School of Mathematics - Statistics and Computer Science - College of Science - University of Tehran, Tehran, Iran
Pages
10
From page
103
To page
112
Abstract
Missing data is a very common problem in all research fields. Case deletion
is a simple way to handle incomplete data sets which could mislead to biased statistical
results. A more reliable approach to handle missing values is imputation which allows
covariate-dependent missing mechanism, as well. This paper aims to prepare guidance
for researchers facing missing data problems by comparing various imputation methods
including machine learning techniques, to achieve better results in supervised learning
tasks. A benchmark dataset has experimented and the results are compared by applying
popular classifiers over varying missing mechanisms and rates on this benchmark
dataset.
Keywords
Imputation , Machine learning algorithms , Missing data , Missing mechanism
Journal title
Journal of Statistical Modelling: Theory and Applications (JSMTA)
Serial Year
2021
Record number
2714896
Link To Document