• DocumentCode
    560919
  • Title

    Enriching time series datasets using Nonparametric kernel regression to improve forecasting accuracy

  • Author

    Widodo, Agus ; Fanani, Mohamad Ivan ; Budi, Indra

  • Author_Institution
    Inf. Retrieval Lab., Univ. of Indonesia, Depok, Indonesia
  • fYear
    2011
  • fDate
    17-18 Dec. 2011
  • Firstpage
    227
  • Lastpage
    232
  • Abstract
    Improving the accuracy of prediction on future values based on the past and current observations has been pursued by enhancing the prediction´s methods, combining those methods or performing data pre-processing. In this paper, another approach is taken, namely by increasing the number of input in the dataset. This approach would be useful especially for a shorter time series data. By filling the in-between values in the time series, the number of training set can be increased, thus increasing the generalization capability of the predictor. The algorithm used to make prediction is Neural Network as it is widely used in literature for time series tasks. For comparison, Support Vector Regression is also employed. The dataset used in the experiment is the frequency of USPTO´s patents and PubMed´s scientific publications on the field of health, namely on Apnea, Arrhythmia, and Sleep Stages. Another time series data designated for NN3 Competition in the field of transportation is also used for benchmarking. The experimental result shows that the prediction performance can be significantly increased by filling in-between data in the time series. Furthermore, the use of detrend and deseasonalization which separates the data into trend, seasonal and stationary time series also improve the prediction performance both on original and filled dataset. The optimal number of increase on the dataset in this experiment is about five times of the length of original dataset.
  • Keywords
    forecasting theory; neural nets; regression analysis; time series; Apnea; Arrhythmia; Sleep Stage; forecasting accuracy; neural network; nonparametric kernel regression; support vector regression; time series dataset; Accuracy; Artificial neural networks; Kernel; Noise; Predictive models; Time series analysis; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computer Science and Information System (ICACSIS), 2011 International Conference on
  • Conference_Location
    Jakarta
  • Print_ISBN
    978-1-4577-1688-1
  • Type

    conf

  • Filename
    6140751