• DocumentCode
    2268021
  • Title

    Sampling Techniques for Large, Dynamic Graphs

  • Author

    Stutzbach, Daniel ; Rejaie, Reza ; Duffield, Nick ; Sen, Subhabrata ; Willinger, Walter

  • Author_Institution
    Univ. of Oregon, Eugene, OR
  • fYear
    2006
  • fDate
    23-29 April 2006
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Peer-to-peer systems are becoming increasingly popular, with millions of simultaneous users and a wide range of applications. Understanding existing systems and devising new peer-to-peer techniques relies on access to representative models derived from empirical observations. Due to the large and dynamic nature of these systems, directly capturing global behavior is often impractical. Sampling is a natural approach for learning about these systems, and most previous studies rely on it to collect data. This paper addresses the common problem of selecting representative samples of peer properties such as peer degree, link bandwidth, or the number of files shared. A good sampling technique will select any of the peers present with equal probability. However, common sampling techniques introduce bias in two ways. First, the dynamic nature of peers can bias results towards short-lived peers, much as naively sampling flows in a router can lead to bias towards short-lived flows. Second, the heterogeneous overlay topology can lead to bias towards high-degree peers. We present preliminary evidence suggesting that applying a degree-correction method to random walk-based peer selection leads to unbiased sampling, at the expense of a loss of efficiency.
  • Keywords
    peer-to-peer computing; probability; telecommunication network routing; telecommunication network topology; degree-correction method; heterogeneous overlay topology; peer-to-peer systems; sampling techniques; short-lived peers; walk-based peer selection; Bandwidth; Internet telephony; Peer to peer computing; Performance gain; Robustness; Sampling methods; Topology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    INFOCOM 2006. 25th IEEE International Conference on Computer Communications. Proceedings
  • Conference_Location
    Barcelona
  • ISSN
    0743-166X
  • Print_ISBN
    1-4244-0221-2
  • Type

    conf

  • DOI
    10.1109/INFOCOM.2006.39
  • Filename
    4146692