• DocumentCode
    3585102
  • Title

    Par-BF: A Parallel Partitioned Bloom Filter for Dynamic Data Sets

  • Author

    Yi Liu ; Xiongzi Ge ; Du, David H. C. ; Xiaoxia Huang

  • Author_Institution
    Shenzhen Inst. of Adv. Technol., Shenzhen, China
  • fYear
    2014
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Compared with a hash table, a Bloom Filter (BF) is more space-efficient for supporting fast matching though resulting in a controllable and acceptable false positive probability. The space size of the basic BF is predetermined based on the expected number of elements to be stored. However, we cannot predict the scale of a BF space for dynamic sets. The two existing solutions for supporting dynamic sets, Scalable BF (SBF) and Dynamic BF (DBF), still face some challenges on system performance and memory overhead.This paper presents a new BF for dynamic data sets, called Partitioned BF (Par-BF). Compared with DBF and SBF, the size and the range of the false positive probability can be calculated by a group of formulas to leverage a sweet spot between high-performance and low-overhead. Moreover, Par-BF supports parallel fast matching which can improve the overall throughput. From our trace-driven experimental results, the IOPS of Par-BF outperforms that of DBF and SBF from 6X to 10X, and from 2X to 4X, respectively. Meanwhile, through our proposed garbage collection policy, the memory overhead of Par-BF is less than half of the memory usage of SBF.
  • Keywords
    data structures; pattern matching; probability; storage management; DBF; SBF; dynamic BF; dynamic data sets; dynamic sets; false positive probability; garbage collection policy; memory overhead; par-BF; parallel fast matching; parallel partitioned bloom filter; scalable BF; trace-driven experimental results; Aerospace electronics; Equations; Indexes; Instruction sets; Mathematical model; Memory management; Resource management; Bloom Filter; dynamic sets; Par-BF; Parallel;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Intensive Scalable Computing Systems (DISCS), 2014 International Workshop on
  • Type

    conf

  • DOI
    10.1109/DISCS.2014.5
  • Filename
    7079020