Title :
Missing values imputation hypothesis: An experimental evaluation
Author :
Li, Huaxiong ; Zhou, Xianzhong ; Yao, Yiyu
Author_Institution :
Sch. of Manage. & Eng., Nanjing Univ., Nanjing, China
Abstract :
Missing values imputation is a basic strategy to deal with incomplete data. Many developed methods treat filled-in values as if they are original data. The correctness of such hypothesis has not been widely studied. In this paper, a philosophical and experimental study on the hypothesis of missing values imputation is discussed. In the experiments, classification accuracy of three learning algorithms with regard to six incomplete data sets are compared, which indicates that missing values imputation may not always help to improve the learning performance. Learning directly from incomplete data without imputation may reach a satisfying performance. The study not only provides an experimental analysis on missing values imputation, but also presents a new view on rule induction from incomplete data, which is much different from previous standpoint.
Keywords :
data mining; learning (artificial intelligence); incomplete data; learning algorithms; missing values imputation hypothesis; rule induction; Computer science; Data engineering; Data mining; Engineering management; Laboratories; Machine learning; Technology management; imputation; incomplete data; missing values; rule induction;
Conference_Titel :
Cognitive Informatics, 2009. ICCI '09. 8th IEEE International Conference on
Conference_Location :
Kowloon, Hong Kong
Print_ISBN :
978-1-4244-4642-1
DOI :
10.1109/COGINF.2009.5250727