DocumentCode
837187
Title
Space-Code Bloom Filter for Efficient Per-Flow Traffic Measurement
Author
Kumar, Abhishek ; Jun Xu ; Wang, Jia
Author_Institution
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA
Volume
24
Issue
12
fYear
2006
Firstpage
2327
Lastpage
2339
Abstract
Per-flow traffic measurement is critical for usage accounting, traffic engineering, and anomaly detection. Previous methodologies are either based on random sampling (e.g., Cisco\´s NetFlow), which is inaccurate, or only account for the "elephants." We introduce a novel technique for measuring per-flow traffic approximately, for all flows regardless of their sizes, at very high-speed (say, OC768). The core of this technique is a novel data structure called Space-Code Bloom Filter (SCBF). A SCBF is an approximate representation of a multiset; each element in this multiset is a traffic flow and its multiplicity is the number of packets in the flow. The multiplicity of an element in the multiset represented by SCBF can be estimated through either of two mechanisms-maximum-likelihood estimation or mean value estimation. Through parameter tuning, SCBF allows for graceful tradeoff between measurement accuracy and computational and storage complexity. SCBF also contributes to the foundation of data streaming by introducing a new paradigm called blind streaming. We evaluate the performance of SCBF through mathematical analysis and through experiments on packet traces gathered from a tier-1 ISP backbone. Our results demonstrate that SCBF achieves reasonable measurement accuracy with very low storage and computational complexity. We also demonstrate the application of SCBF in estimating the frequency of keywords at a search engine-demonstrating the applicability of SCBF to other problems that can be reduced to multiset membership queries
Keywords
Internet; data structures; information filters; maximum likelihood estimation; sampling methods; telecommunication traffic; Internet service provider; SCBF; Space-Code Bloom Filter; blind streaming; computational complexity; data streaming; data structure; mathematical analysis; maximum-likelihood estimation; mean value estimation; per-flow traffic measurement; search engine; tier-1 ISP backbone; Bloom filter (BF); data structures; network measurement; statistical inference; traffic analysis;
fLanguage
English
Journal_Title
Selected Areas in Communications, IEEE Journal on
Publisher
ieee
ISSN
0733-8716
Type
jour
DOI
10.1109/JSAC.2006.884032
Filename
4016143
Link To Document