DocumentCode :
2069906
Title :
Comparison of missing data imputation methods for traffic flow
Author :
Chang, Gang ; Ge, Tongmin
Author_Institution :
Dept. of Mil. Transp., Mil. Transp. Univ., Tianjin, China
fYear :
2011
fDate :
16-18 Dec. 2011
Firstpage :
639
Lastpage :
642
Abstract :
The complete and reliable field traffic data are vital for the planning, design and operation of urban traffic management systems. However, the problem of traffic data missing widely exists in many traffic information systems, which brings great troubles to the further utilization. Some approaches for imputing missing traffic data are needed, therefore, to minimize the effect of incomplete data on utilization. There are also problems of value missing in microarray data and several methods to estimate the missing values are proposed. These methods don´t exploit any biology knowledge to estimate the missing value and they are just methods for data mining which can also be applied to the imputation of traffic flow data. In the paper, ten imputation methods for handling missing value problem of microarray data are compared with Bayesian Principle Component Analysis (BPCA) imputation method which is convinced to outperform many conventional approaches. Experiment analysis shows that LSI_gene, LSI_array, LSI_combined, LSI_adaptive, EM_gene and local least square imputation methods outperform BPCA and be good choices to deal with the problem of missing traffic data imputation.
Keywords :
data mining; least squares approximations; minimisation; road traffic; traffic information systems; EM_gene imputation method; LSI_adaptive imputation method; LSI_array imputation method; LSI_combined imputation method; LSI_gene imputation method; data mining; field traffic data; incomplete data effect minimization; local least square imputation method; microarray data; missing data imputation methods; missing value estimation method; traffic flow data; traffic information systems; urban traffic management system design; urban traffic management system planning; Bayesian methods; Detectors; Distributed databases; Educational institutions; Mean square error methods; Testing; Transportation; Bayesian Principle Component Analysis (BPCA); Comparison; Expectation Maximization (EM); Least Square imputation(LSI); Local Least Squares imputation(LLSI); Missing value Imputation; Row Mean; Traffic flow; k-nearest neighbors(KNN);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Transportation, Mechanical, and Electrical Engineering (TMEE), 2011 International Conference on
Conference_Location :
Changchun
Print_ISBN :
978-1-4577-1700-0
Type :
conf
DOI :
10.1109/TMEE.2011.6199284
Filename :
6199284
Link To Document :
بازگشت