DocumentCode
2914279
Title
Treating missing data processing based on neural network and AdaBoost
Author
Zhimin, Miao ; Zhisong, Pan ; Guyu, Hu ; Luwen, Zhao
Author_Institution
ICA PLA Univ. of Sci. & Technol., Nanjing
fYear
2007
fDate
18-20 Nov. 2007
Firstpage
1107
Lastpage
1111
Abstract
Missing data is a common problem in data quality. Such data are generally ignored or simply substituted in classification problem, which will affect the performance of a classifier. In the paper an innovative framework RBP-AdaBoost for handling with missing features values in classification is presented. This framework is composed of two parts: predicting the missing values and classifying the data including predicted missing values. Back-propagation algorithm (BP) is adopted to predict missing value firstly, and Adaptive Boosting (AdaBoost) as a methodology of aggregation of many weak classifiers into one strong classifier is used in classifying predicted missing data. We carry out experiments with nine UCI datasets to evaluate the effect on classification error rate of four general methods and the prediction model of BP. Experimental results show that the classification rate of the proposed new framework RBP-AdaBoost is increased 6.4% to 23.69% comparing with other methods. The performance of missing data treatment model is considered to be effective.
Keywords
backpropagation; neural nets; pattern classification; AdaBoost; back-propagation algorithm; classification problem; data quality; missing data processing; neural network; Boosting; Classification tree analysis; Data mining; Data processing; Error analysis; Intelligent networks; Intelligent systems; Neural networks; Parameter estimation; Predictive models;
fLanguage
English
Publisher
ieee
Conference_Titel
Grey Systems and Intelligent Services, 2007. GSIS 2007. IEEE International Conference on
Conference_Location
Nanjing
Print_ISBN
978-1-4244-1294-5
Electronic_ISBN
978-1-4244-1294-5
Type
conf
DOI
10.1109/GSIS.2007.4443444
Filename
4443444
Link To Document