• DocumentCode
    243664
  • Title

    Imputation of Missing Values in Time Series with Lagged Correlations

  • Author

    Rahman, Shah Atiqur ; Yuxiao Huang ; Claassen, Jan ; Kleinberg, Samantha

  • Author_Institution
    Stevens Inst. of Technol., Hoboken, NJ, USA
  • fYear
    2014
  • fDate
    14-14 Dec. 2014
  • Firstpage
    753
  • Lastpage
    762
  • Abstract
    Missing values are a common problem in real world data and are particularly prevalent in biomedical time series, where a patient´s medical record may be split across multiple institutions or a device may briefly fail. These data are not missing completely at random, so ignoring the missing values can lead to bias and error during data mining. However, current methods for imputing missing values have yet to account for the fact that variables are correlated and that those relationships exist across time. To address this, we propose an imputation method (FLk-NN) that incorporates time lagged correlations both within and across variables by combining two imputation methods, based on an extension to k-NN and the Fourier transform. This enables imputation of missing values even when all data at a time point is missing and when there are different types of missingness both within and across variables. In comparison to other approaches on two biological datasets (simulated glucose in Type 1 diabetes and multi-modality neurological ICU monitoring) the proposed method has the highest imputation accuracy. This was true for up to half the data being missing and when consecutive missing values are a significant fraction of the overall time series length.
  • Keywords
    Fourier transforms; data mining; electronic health records; time series; FLk-NN; Fourier transform; biomedical time series; data mining; missing value imputation; patient medical record; time lagged correlations; Correlation; Fourier transforms; Insulin; Sugar; Time series analysis; Training; Vectors; Fourier imputation; correlated data with time-lag; extended k-NN imputation; missing data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4275-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2014.110
  • Filename
    7022671