DocumentCode :
2239840
Title :
Performance evaluation on data reconciliation algorithm in distributed system
Author :
Xin Wang ; Hongming Zhu ; Qin Liu ; Xiaowen Yang ; Jiakai Xiao
Author_Institution :
Sch. of Software & Eng., Tongji Univ., Shanghai, China
fYear :
2012
fDate :
Oct. 30 2012-Nov. 1 2012
Firstpage :
381
Lastpage :
385
Abstract :
This research came from a school-enterprise cooperation program, which aims to improve data reconciliation efficiency between two large-scale data sources. This paper mainly presents three typical algorithms: standard Bloom filter (BF), counting Bloom filter (CBF) and Invertible Bloom filter (IBF). With the purpose of evaluating their performance, mainly on runtime and accuracy rate, a series of experiments were designed and applied to both a small-scale and a large-scale distributed system. These algorithms are compared based on one traditional query method Inner Join (IJ). And the result shows: under the MapReduce computing framework, Inner Join, followed by BF closely, has the best performance; large-scale distributed system can evidently improve the performance on dealing with large-scale data.
Keywords :
data structures; parallel algorithms; query processing; software performance evaluation; BF; CBF; IBF; IJ; MapReduce computing framework; counting Bloom filter; data reconciliation algorithm; inner join query method; invertible Bloom filter; large-scale data; large-scale data sources; large-scale distributed system; performance evaluation; school-enterprise cooperation program; small-scale distributed system; standard Bloom filter; Accuracy; Algorithm design and analysis; Arrays; Clustering algorithms; Distributed databases; Filtering algorithms; Runtime; Bloom filter; Data reconciliation; Hadoop; Large-scale;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing and Intelligent Systems (CCIS), 2012 IEEE 2nd International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4673-1855-6
Type :
conf
DOI :
10.1109/CCIS.2012.6664432
Filename :
6664432
Link To Document :
بازگشت