DocumentCode :
2579439
Title :
Imputation for the analysis of missing values and prediction of time series data
Author :
Sridevi, S. ; Rajaram, S. ; Parthiban, C. ; SibiArasan, S. ; Swadhikar, C.
Author_Institution :
Dept. of CSE, Thiagarajar Coll. of Eng., Madurai, India
fYear :
2011
fDate :
3-5 June 2011
Firstpage :
1158
Lastpage :
1163
Abstract :
Data preprocessing plays an important and critical role in the data mining process. Data preprocessing is required in order to improve the efficiency of an algorithm. This paper focuses on missing value estimation and prediction of time series data based on the historical values. A number of algorithms have been developed to solve this problem, but they have several limitations. Most existing algorithms like KNNimpute (K-Nearest Neighbours imputation), BPCA (Bayesian Principal Component Analysis) and SVDimpute (Singular Value Decomposition imputation) are not able to deal with the situation where a particular time point (column) of the data is missing entirely. This paper focuses on autoregressive-model-based missing value estimation method (ARLSimpute) which is effective for the situation where a particular time point contains many missing values or where the entire time point is missing. Data preprocessing output is given to the input of the prediction techniques namely linear prediction and quadratic prediction. These techniques are used to predict the future values based on the historical values. The performance of the algorithm is measured by performance metrics like precision and recall. Experimental results on real-life datasets demonstrate that the proposed algorithm is effective and efficient to reveal future time series data.
Keywords :
autoregressive moving average processes; data mining; estimation theory; ARLSimpute; autoregressive-model-based missing value estimation method; data mining process; data preprocessing; missing value analysis; time series data prediction; Algorithm design and analysis; Data mining; Databases; Estimation; Prediction algorithms; Predictive models; Time series analysis; Auto-Regressive (AR) model; Prediction; Temporal Databases; time series analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recent Trends in Information Technology (ICRTIT), 2011 International Conference on
Conference_Location :
Chennai, Tamil Nadu
Print_ISBN :
978-1-4577-0588-5
Type :
conf
DOI :
10.1109/ICRTIT.2011.5972466
Filename :
5972466
Link To Document :
بازگشت