Title :
Par-BF: A Parallel Partitioned Bloom Filter for Dynamic Data Sets
Author :
Yi Liu ; Xiongzi Ge ; Du, David H. C. ; Xiaoxia Huang
Author_Institution :
Shenzhen Inst. of Adv. Technol., Shenzhen, China
Abstract :
Compared with a hash table, a Bloom Filter (BF) is more space-efficient for supporting fast matching though resulting in a controllable and acceptable false positive probability. The space size of the basic BF is predetermined based on the expected number of elements to be stored. However, we cannot predict the scale of a BF space for dynamic sets. The two existing solutions for supporting dynamic sets, Scalable BF (SBF) and Dynamic BF (DBF), still face some challenges on system performance and memory overhead.This paper presents a new BF for dynamic data sets, called Partitioned BF (Par-BF). Compared with DBF and SBF, the size and the range of the false positive probability can be calculated by a group of formulas to leverage a sweet spot between high-performance and low-overhead. Moreover, Par-BF supports parallel fast matching which can improve the overall throughput. From our trace-driven experimental results, the IOPS of Par-BF outperforms that of DBF and SBF from 6X to 10X, and from 2X to 4X, respectively. Meanwhile, through our proposed garbage collection policy, the memory overhead of Par-BF is less than half of the memory usage of SBF.
Keywords :
data structures; pattern matching; probability; storage management; DBF; SBF; dynamic BF; dynamic data sets; dynamic sets; false positive probability; garbage collection policy; memory overhead; par-BF; parallel fast matching; parallel partitioned bloom filter; scalable BF; trace-driven experimental results; Aerospace electronics; Equations; Indexes; Instruction sets; Mathematical model; Memory management; Resource management; Bloom Filter; dynamic sets; Par-BF; Parallel;
Conference_Titel :
Data Intensive Scalable Computing Systems (DISCS), 2014 International Workshop on
DOI :
10.1109/DISCS.2014.5