Title :
Bloom filter based optimization on HBase with MapReduce
Author :
Bhushan, Mani ; Banerjea, Shashwati ; Yadav, Santosh Kumar
Author_Institution :
Dept. of Comput. Sci. & Eng., Motilal Nehru Nat. Inst. of Technol., Allahabad, India
Abstract :
In recent time, HBase is growing with its requirement as data exploding in E-world. Hadoop provide distributed manner and taking technology to next level as it is using HBase to store bulk data. HBase works with Hadoop to transfer data to datanodes. As traditionally large file need to split for performing operations but MapReduce provide facility to do operations on single file. This paper providing concept to reduce traffic of data on network with efficient manner through probabilistic model of Bloom filter. Instead of taking whole data, Bloom filter is providing join operation only with array data structure by developing global filter.
Keywords :
data structures; parallel programming; statistical analysis; Bloom filter based optimization; HBase; MapReduce; e-world; join operation; probabilistic model; Arrays; Big data; Databases; Facebook; Real-time systems; Servers; Big Data; Bloom filter; Hadoop; MapReduce;
Conference_Titel :
Data Mining and Intelligent Computing (ICDMIC), 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-4675-4
DOI :
10.1109/ICDMIC.2014.6954230