DocumentCode
3304669
Title
An Information Divergence Estimation over Data Streams
Author
Anceaume, Emmanuelle ; Busnel, Yann
Author_Institution
IRISA, Rennes, France
fYear
2012
fDate
23-25 Aug. 2012
Firstpage
28
Lastpage
35
Abstract
In this paper, we consider the setting of large scale distributed systems, in which each node needs to quickly process a huge amount of data received in the form of a stream that may have been tampered with by an adversary. In this situation, a fundamental problem is how to detect and quantify the amount of work performed by the adversary. To address this issue, we have proposed in a prior work, AnKLe, a one pass algorithm for estimating the Kullback-Leibler divergence of an observed stream compared to the expected one. Experimental evaluations have shown that the estimation provided by AnKLe is accurate for different adversarial settings for which the quality of other methods dramatically decreases. In the present paper, considering n as the number of distinct data items in a stream, we show that AnKLe is an (ε, δ)-approximation algorithm with a space complexity Õ(1/ε + 1/ε2) bits in “most” cases, and Õ(1/ε + n-ε-1/ε2 ) otherwise. To the best of our knowledge, an approximation algorithm for estimating the Kullback-Leibler divergence has never been analyzed before.
Keywords
approximation theory; computational complexity; data handling; distributed processing; estimation theory; (ε, δ)-approximation algorithm; AnKLe; Kullback-Leibler divergence estimation; data processing; data stream; information divergence estimation; large scale distributed system; space complexity; Algorithm design and analysis; Approximation algorithms; Approximation methods; Entropy; Estimation; Frequency estimation; Radiation detectors; Data stream; divergence; randomized approximation algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Network Computing and Applications (NCA), 2012 11th IEEE International Symposium on
Conference_Location
Cambridge, MA
Print_ISBN
978-1-4673-2214-0
Type
conf
DOI
10.1109/NCA.2012.16
Filename
6299123
Link To Document