• DocumentCode
    2914279
  • Title

    Treating missing data processing based on neural network and AdaBoost

  • Author

    Zhimin, Miao ; Zhisong, Pan ; Guyu, Hu ; Luwen, Zhao

  • Author_Institution
    ICA PLA Univ. of Sci. & Technol., Nanjing
  • fYear
    2007
  • fDate
    18-20 Nov. 2007
  • Firstpage
    1107
  • Lastpage
    1111
  • Abstract
    Missing data is a common problem in data quality. Such data are generally ignored or simply substituted in classification problem, which will affect the performance of a classifier. In the paper an innovative framework RBP-AdaBoost for handling with missing features values in classification is presented. This framework is composed of two parts: predicting the missing values and classifying the data including predicted missing values. Back-propagation algorithm (BP) is adopted to predict missing value firstly, and Adaptive Boosting (AdaBoost) as a methodology of aggregation of many weak classifiers into one strong classifier is used in classifying predicted missing data. We carry out experiments with nine UCI datasets to evaluate the effect on classification error rate of four general methods and the prediction model of BP. Experimental results show that the classification rate of the proposed new framework RBP-AdaBoost is increased 6.4% to 23.69% comparing with other methods. The performance of missing data treatment model is considered to be effective.
  • Keywords
    backpropagation; neural nets; pattern classification; AdaBoost; back-propagation algorithm; classification problem; data quality; missing data processing; neural network; Boosting; Classification tree analysis; Data mining; Data processing; Error analysis; Intelligent networks; Intelligent systems; Neural networks; Parameter estimation; Predictive models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Grey Systems and Intelligent Services, 2007. GSIS 2007. IEEE International Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4244-1294-5
  • Electronic_ISBN
    978-1-4244-1294-5
  • Type

    conf

  • DOI
    10.1109/GSIS.2007.4443444
  • Filename
    4443444