DocumentCode :
683947
Title :
An optimized ETL fault-tolerant algorithm in data warehouses
Author :
Tu, Shitao ; Zhu, Lanjuan
Author_Institution :
Department of Automation, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, 200240, China
fYear :
2013
fDate :
23-25 March 2013
Firstpage :
484
Lastpage :
487
Abstract :
Extraction-Transformation-Loading (ETL) plays an important role in data warehouse. Typically, performance is considered the main factor in ETL projects. Actually, faulttolerance and many other aspects influence the results of ETL greatly especially when the time period of projects are long and transformation rules cannot be determined from beginning, such as the situation of changing business logic. To satisfy the fault-tolerance and data validation in such kinds of situation, in this paper, we introduce a fault-tolerant algorithm which gives Redo strategy for different process interrupt scenarios. Moreover, we present a compound refresh mode consisting of full and incremental refresh to guarantee data correctness in changing business logic as well as timely data migration.
Keywords :
Business; Compounds; Data warehouses; Databases; Engines; Fault tolerance; Fault tolerant systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Technology (ICIST), 2013 International Conference on
Conference_Location :
Yangzhou
Print_ISBN :
978-1-4673-5137-9
Type :
conf
DOI :
10.1109/ICIST.2013.6747594
Filename :
6747594
Link To Document :
بازگشت