DocumentCode
2848074
Title
Optimizing ETL processes in data warehouses
Author
Simitsis, Alkis ; Vassiliadis, Panos ; Sellis, Timos
Author_Institution
Nat. Tech. Univ. of Athens, Greece
fYear
2005
fDate
5-8 April 2005
Firstpage
564
Lastpage
575
Abstract
Extraction-transformation-loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. Usually, these processes must be completed in a certain time window; thus, it is necessary to optimize their execution time. In this paper, we delve into the logical optimization of ETL processes, modeling it as a state-space search problem. We consider each ETL workflow as a state and fabricate the state space through a set of correct state transitions. Moreover, we provide algorithms towards the minimization of the execution cost of an ETL workflow.
Keywords
data warehouses; minimisation; search problems; ETL workflow; data warehouse; extraction-transformation-loading tools; logical optimization; minimization; state transition; state-space search problem; Costs; Data mining; Data warehouses; Databases; Design optimization; Minimization methods; Network address translation; Search problems; Software tools; State-space methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2005. ICDE 2005. Proceedings. 21st International Conference on
ISSN
1084-4627
Print_ISBN
0-7695-2285-8
Type
conf
DOI
10.1109/ICDE.2005.103
Filename
1410172
Link To Document