• DocumentCode
    2474642
  • Title

    Missing categorical data imputation approach based on similarity

  • Author

    Wu, Sen ; Feng, Xiaodong ; Han, Yushan ; Wang, Qiang

  • Author_Institution
    Dongling Sch. of Econ. & Manage., Univ. of Sci. & Technol. Beijing, Beijing, China
  • fYear
    2012
  • fDate
    14-17 Oct. 2012
  • Firstpage
    2827
  • Lastpage
    2832
  • Abstract
    Imputation for missing data is an important task of data mining, which may influence the data mining result. In this paper, Missing Categorical Data Imputation Based on Similarity (MIBOS) is proposed to solve this problem. The algorithm defines a similarity model between objects with incomplete data, constructing the similarity matrix of objects and further gets the nearest undifferentiated object sets of each object to impute the missing data iteratively. In the imputing process, the imputed value will be directly applied to the same iteration and the following iterations. Experiments with three UCI benchmark data sets show the improvement of the proposed algorithm from perspectives of complete rate, accuracy and time efficiency.
  • Keywords
    data mining; MIBOS; Missing Categorical Data Imputation Based on Similarity; UCI benchmark data sets; data mining; missing categorical data imputation approach; missing data; object similarity matrix; similarity model; Accuracy; Algorithm design and analysis; Data mining; Data models; Heart; Information systems; Single photon emission computed tomography; data mining; missing data imputation; rough set‥; similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
  • Conference_Location
    Seoul
  • Print_ISBN
    978-1-4673-1713-9
  • Electronic_ISBN
    978-1-4673-1712-2
  • Type

    conf

  • DOI
    10.1109/ICSMC.2012.6378177
  • Filename
    6378177