DocumentCode :
2839818
Title :
Distributed Incomplete Pattern Matching via a Novel Weighted Bloom Filter
Author :
Liu, Siyuan ; Kang, Lei ; Chen, Lei ; Ni, Lionel
Author_Institution :
iLab, Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2012
fDate :
18-21 June 2012
Firstpage :
122
Lastpage :
131
Abstract :
In this paper, we first propose a very interesting and practical problem, pattern matching in a distributed mobile environment. Pattern matching is a well-known problem and extensive research has been conducted for performing effective and efficient search. However, previous proposed approaches assume that data are centrally stored, which is not the case in a mobile environment (e.g., mobile phone networks), where one person´s pattern could be separately stored in a number of different stations, and such a local pattern is incomplete compared with the global pattern. A simple solution to pattern matching over a mobile environment is to collect all the data distributed in base stations to a data center and conduct pattern matching at the data center afterwards. Clearly, such a simple solution will raise huge amount of communication traffic, which could cause the communication bottleneck brought by the limited wireless bandwidth to be even worse. Therefore, a communication efficient and search effective solution is necessary. In our work, we present a novel solution which is based on our well-designed Weighted Bloom Filter (WBF), called, Distributed Incomplete pattern matching (DI-matching), to find target patterns over a distributed mobile environment. Specifically, to save communication cost and ensure pattern matching in distributed incomplete patterns, we use WBF to encode a query pattern and disseminate the encoded data to each base station. Each base station conducts a local pattern search according to the received WBF. Only qualified IDs and corresponding weights in each base station are sent to the data center for aggregation and verification. Through extensive empirical experiments on a real city-scale mobile networks data set, we demonstrate the effectiveness and efficiency of our proposed solutions.
Keywords :
data analysis; data structures; mobile radio; pattern matching; query processing; DI-matching; base stations; city-scale mobile network; communication bottleneck; communication cost; communication traffic; data aggregation; data center; data verification; distributed data collection; distributed incomplete pattern matching; distributed mobile environment; encoded data dissemination; global pattern; limited wireless bandwidth; local pattern search; mobile phone network; person pattern; query pattern encoding; search effective solution; space-efficient randomized data structure; target pattern finding; weighted bloom filter; Base stations; Distributed databases; Matched filters; Mobile communication; Mobile handsets; Pattern matching; Time series analysis; Incomplete pattern matching; Weighted Bloom Filter; distributed mobile environment; time series;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Distributed Computing Systems (ICDCS), 2012 IEEE 32nd International Conference on
Conference_Location :
Macau
ISSN :
1063-6927
Print_ISBN :
978-1-4577-0295-2
Type :
conf
DOI :
10.1109/ICDCS.2012.24
Filename :
6257985
Link To Document :
بازگشت