DocumentCode :
1573087
Title :
Improvement of snapshot differential algorithm based on hadoop platform
Author :
Yuan, Guoyong ; Li, Bin ; Xiao, Taiyang
Author_Institution :
Dept. of Comput. Sci., Jinan Univ., Guangzhou, China
Volume :
2
fYear :
2011
Firstpage :
1212
Lastpage :
1214
Abstract :
Snapshot differential algorithm is one of ways of extracting delta from views in the data warehouse in data integration circumstance. Due to the scale of the views in data warehouse is likely to be very massive, it will take lots of time to run snapshot differential algorithm and become the bottleneck of the system performance. In this paper, in order to improve efficiency of Snapshot Differential Algorithm, by using the massive data processing platform, we modify traditional Partition Hash algorithm, improve the efficiency and reduce the calculating time. At the end of this paper, we show a test which will demonstrate the improvement of efficiency after modification.
Keywords :
data warehouses; data integration; data warehouse; hadoop platform; massive data processing platform; partition hash algorithm; snapshot differential algorithm; delta extraction; distributed computing; snapshot differential algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cross Strait Quad-Regional Radio Science and Wireless Technology Conference (CSQRWC), 2011
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-9792-8
Type :
conf
DOI :
10.1109/CSQRWC.2011.6037179
Filename :
6037179
Link To Document :
بازگشت