DocumentCode
2845899
Title
A Comparison Study of Missing Value Processing Methods in Time Series Data Mining
Author
Jiang, Yi ; Lan, Tuo ; Wu, LiHua
Author_Institution
Dept. of Comput. Sci., Xiamen Univ., Xiamen, China
fYear
2009
fDate
11-13 Dec. 2009
Firstpage
1
Lastpage
4
Abstract
There are many methods for dealing with missing value on time series mining. The regression model is better than other methods when the variables of the data are correlative. This paper uses the method of mean interpolation and one variable linear regression, multivariate linear regression and iterative regression method of regression interpolation to deal with missing value of hydrological time series database and compares them under different Pearson correlation coefficient. The study shows that the one variable linear regression is simple and intuitive, and has a higher accuracy when data gaps exist variables that have a great relevance with missing variable values, and it shows that multivariate linear regression and multiple regression iteration have better results when the data doesn ´t.
Keywords
data mining; iterative methods; regression analysis; time series; Pearson correlation coefficient; data mining; iterative regression; missing value processing; multivariate linear regression; regression model; time series; variable linear regression; Clustering algorithms; Computer science; Data mining; Databases; Decision trees; Finance; Interpolation; Iterative methods; Linear regression; Multivariate regression;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-4507-3
Electronic_ISBN
978-1-4244-4507-3
Type
conf
DOI
10.1109/CISE.2009.5365076
Filename
5365076
Link To Document