• DocumentCode
    610375
  • Title

    Forecasting the data cube: A model configuration advisor for multi-dimensional data sets

  • Author

    Fischer, Ulrich ; Schildt, C. ; Hartmann, C. ; Lehner, Wolfgang

  • Author_Institution
    Database Technol. Group, Dresden Univ. of Technol., Dresden, Germany
  • fYear
    2013
  • fDate
    8-12 April 2013
  • Firstpage
    853
  • Lastpage
    864
  • Abstract
    Forecasting time series data is crucial in a number of domains such as supply chain management and display advertisement. In these areas, the time series data to forecast is typically organized along multiple dimensions leading to a high number of time series that need to be forecasted. Most current approaches focus only on selection and optimizing a forecast model for a single time series. In this paper, we explore how we can utilize time series at different dimensions to increase forecast accuracy and, optionally, reduce model maintenance overhead. Solving this problem is challenging due to the large space of possibilities and possible high model creation costs. We propose a model configuration advisor that automatically determines the best set of models, a model configuration, for a given multi-dimensional data set. Our approach is based on a general process that iteratively examines more and more models and simultaneously controls the search space depending on the data set, model type and available hardware. The final model configuration is integrated into F2DB, an extension of PostgreSQL, that processes forecast queries and maintains the configuration as new data arrives. We comprehensively evaluated our approach on real and synthetic data sets. The evaluation shows that our approach significantly increases forecast query accuracy while ensuring low model costs.
  • Keywords
    SQL; data handling; query processing; relational databases; time series; PostgreSQL; Structured Query Language; data cube forecasting; display advertisement; forecast accuracy; model configuration advisor; multidimensional data set; query accuracy; supply chain management; time series data forecasting; time series forecast model; Accuracy; Cities and towns; Data models; Forecasting; Numerical models; Predictive models; Time series analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2013 IEEE 29th International Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    1063-6382
  • Print_ISBN
    978-1-4673-4909-3
  • Electronic_ISBN
    1063-6382
  • Type

    conf

  • DOI
    10.1109/ICDE.2013.6544880
  • Filename
    6544880