DocumentCode
3847048
Title
Sampling Strategies for Epidemic-Style Information Dissemination
Author
Milan Vojnovic;Varun Gupta;Thomas Karagiannis;Christos Gkantsidis
Author_Institution
Microsoft Research Cambridge, United Kingdom
Volume
18
Issue
4
fYear
2010
Firstpage
1013
Lastpage
1025
Abstract
We consider epidemic-style information dissemination strategies that leverage the nonuniformity of host distribution over subnets (e.g., IP subnets) to optimize the information spread. Such epidemic-style strategies are based on random sampling of target hosts according to a sampling rule. In this paper, we consider the metric of total number of samplings (equivalently probes) to reach a given target fraction of the host population. We first identify the minimum number of samplings needed to reach a target fraction of hosts, assuming global information about the host distribution over subnets is available. We show that this optimum can be achieved either by a dynamic strategy, for which the sampling probabilities over subnets are allowed to vary over time, or, surprisingly, even by a static strategy, for which the sampling probabilities over subnets are fixed. These results provide insights about the best achievable performance and how different system parameters affect the number of sampling needed. We then consider simple online sampling strategies that do not require any prior knowledge of the distribution of hosts over subnets, but where each host biases sampling based on its observed sampling outcomes while keeping only O(1) state at any point in time. Using real data-sets from several large-scale Internet measurements, we evaluate significance of the system parameters that determine the sampling requirements and compare the performance of our proposed distribution-oblivious sampling strategies to the theoretical bound. Our results provide insights for the design of efficient information dissemination systems, as well as for the design of countermeasures against worms that use subnet-preferential scanning.
Keywords
"Sampling methods","Internet","Probes","Large-scale systems","Software performance","Databases","Broadcasting"
Journal_Title
IEEE/ACM Transactions on Networking
Publisher
ieee
ISSN
1063-6692
Type
jour
DOI
10.1109/TNET.2010.2051233
Filename
5487433
Link To Document