Title :
Near-optimal random walk sampling in distributed networks
Author :
Sarma, Atish Das ; Molla, Anisur Rahaman ; Pandurangan, Gopal
Author_Institution :
Google Res., Google Inc., Mountain View, CA, USA
Abstract :
Performing random walks in networks is a fundamental primitive that has found numerous applications in communication networks such as token management, load balancing, network topology discovery and construction, search, and peer-to-peer membership management. While several such algorithms are ubiquitous, and use numerous random walk samples, the walks themselves have always been performed naively. In this paper, we focus on the problem of performing random walk sampling efficiently in a distributed network. Given bandwidth constraints, the goal is to minimize the number of rounds and messages required to obtain several random walk samples in a continuous online fashion. We present the first round and message optimal distributed algorithms that present a significant improvement on all previous approaches. The theoretical analysis and comprehensive experimental evaluation of our algorithms show that they perform very well in different types of networks of differing topologies. In particular, our results show how several random walks can be performed continuously (when source nodes are provided only at runtime, i.e., online), such that each walk of length ℓ can be performed exactly in just Õ(√ℓD) rounds (where D is the diameter of the network), and O(ℓ) messages. This significantly improves upon both, the naive technique that requires O(ℓ) rounds and O(ℓ) messages, and the sophisticated algorithm of [13] that has the same round complexity as this paper but requires Ω(m√ℓ) messages (where m is the number of edges in the network). Our theoretical results are corroborated through extensive experiments on various topological data sets. Our algorithms are fully decentralized, lightweight, and easily implementable, and can serve as building blocks in the design of topologically-aware networks.
Keywords :
computational complexity; distributed algorithms; graph theory; telecommunication network topology; bandwidth constraints; building blocks; communication networks; comprehensive experimental evaluation; distributed networks; load balancing; near-optimal random walk sampling; network topologies; network topology discovery; node graph; peer-to-peer membership management; round complexity; round-message optimal distributed algorithms; sophisticated algorithm; source nodes; theoretical analysis; token management; topological data sets; topologically-aware networks; Algorithm design and analysis; Complexity theory; Distributed algorithms; Electronic mail; Graph theory; Network topology; Peer to peer computing;
Conference_Titel :
INFOCOM, 2012 Proceedings IEEE
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4673-0773-4
DOI :
10.1109/INFCOM.2012.6195727