• DocumentCode
    2848074
  • Title

    Optimizing ETL processes in data warehouses

  • Author

    Simitsis, Alkis ; Vassiliadis, Panos ; Sellis, Timos

  • Author_Institution
    Nat. Tech. Univ. of Athens, Greece
  • fYear
    2005
  • fDate
    5-8 April 2005
  • Firstpage
    564
  • Lastpage
    575
  • Abstract
    Extraction-transformation-loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. Usually, these processes must be completed in a certain time window; thus, it is necessary to optimize their execution time. In this paper, we delve into the logical optimization of ETL processes, modeling it as a state-space search problem. We consider each ETL workflow as a state and fabricate the state space through a set of correct state transitions. Moreover, we provide algorithms towards the minimization of the execution cost of an ETL workflow.
  • Keywords
    data warehouses; minimisation; search problems; ETL workflow; data warehouse; extraction-transformation-loading tools; logical optimization; minimization; state transition; state-space search problem; Costs; Data mining; Data warehouses; Databases; Design optimization; Minimization methods; Network address translation; Search problems; Software tools; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
  • ISSN
    1084-4627
  • Print_ISBN
    0-7695-2285-8
  • Type

    conf

  • DOI
    10.1109/ICDE.2005.103
  • Filename
    1410172