• DocumentCode
    2845899
  • Title

    A Comparison Study of Missing Value Processing Methods in Time Series Data Mining

  • Author

    Jiang, Yi ; Lan, Tuo ; Wu, LiHua

  • Author_Institution
    Dept. of Comput. Sci., Xiamen Univ., Xiamen, China
  • fYear
    2009
  • fDate
    11-13 Dec. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    There are many methods for dealing with missing value on time series mining. The regression model is better than other methods when the variables of the data are correlative. This paper uses the method of mean interpolation and one variable linear regression, multivariate linear regression and iterative regression method of regression interpolation to deal with missing value of hydrological time series database and compares them under different Pearson correlation coefficient. The study shows that the one variable linear regression is simple and intuitive, and has a higher accuracy when data gaps exist variables that have a great relevance with missing variable values, and it shows that multivariate linear regression and multiple regression iteration have better results when the data doesn ´t.
  • Keywords
    data mining; iterative methods; regression analysis; time series; Pearson correlation coefficient; data mining; iterative regression; missing value processing; multivariate linear regression; regression model; time series; variable linear regression; Clustering algorithms; Computer science; Data mining; Databases; Decision trees; Finance; Interpolation; Iterative methods; Linear regression; Multivariate regression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4507-3
  • Electronic_ISBN
    978-1-4244-4507-3
  • Type

    conf

  • DOI
    10.1109/CISE.2009.5365076
  • Filename
    5365076