Title :
Improvement of snapshot differential algorithm based on hadoop platform
Author :
Yuan, Guoyong ; Li, Bin ; Xiao, Taiyang
Author_Institution :
Dept. of Comput. Sci., Jinan Univ., Guangzhou, China
Abstract :
Snapshot differential algorithm is one of ways of extracting delta from views in the data warehouse in data integration circumstance. Due to the scale of the views in data warehouse is likely to be very massive, it will take lots of time to run snapshot differential algorithm and become the bottleneck of the system performance. In this paper, in order to improve efficiency of Snapshot Differential Algorithm, by using the massive data processing platform, we modify traditional Partition Hash algorithm, improve the efficiency and reduce the calculating time. At the end of this paper, we show a test which will demonstrate the improvement of efficiency after modification.
Keywords :
data warehouses; data integration; data warehouse; hadoop platform; massive data processing platform; partition hash algorithm; snapshot differential algorithm; delta extraction; distributed computing; snapshot differential algorithm;
Conference_Titel :
Cross Strait Quad-Regional Radio Science and Wireless Technology Conference (CSQRWC), 2011
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-9792-8
DOI :
10.1109/CSQRWC.2011.6037179