• DocumentCode
    592186
  • Title

    On the estimation accuracy of degree distributions from graph sampling

  • Author

    Ribeiro, Bernardete ; Towsley, Don

  • Author_Institution
    Comput. Sci. Dept., Univ. of Massachusetts, Amherst, MA, USA
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    5240
  • Lastpage
    5247
  • Abstract
    Estimating characteristics of large graphs via sampling is vital in the study of complex networks. In this work, we study the Mean Squared Error (MSE) associated with different sampling methods for the degree distribution. These sampling methods include independent random vertex (RV) and random edge (RE) sampling, and crawling methods such as random walks (RWs) and the widely used Metropolis-Hastings algorithm for uniformly sampling vertices (MHRWu). We see that the RW MSE is upper bounded by a quantity that is proportional to the RE MSE and inversely proportional to the spectral gap of the RW transition probability matrix. We also determine conditions under which RW is preferable to RV. Finally, we present an approximation of the MHRWu MSE. We evaluate the accuracy of our approximations and bounds through simulations on large real world graphs.
  • Keywords
    complex networks; graph theory; matrix algebra; mean square error methods; probability; sampling methods; MHRWu; MSE; Metropolis-Hastings algorithm; RE; RV; RW transition probability matrix; complex networks; crawling methods; degree distributions; estimation accuracy; graph sampling; independent random vertex sampling; large graph characteristics estimation; large real world graphs; mean squared error; random edge sampling; random walks; Approximation methods; Eigenvalues and eigenfunctions; Estimation error; Random variables; Reactive power; Standards; Upper bound;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
  • Conference_Location
    Maui, HI
  • ISSN
    0743-1546
  • Print_ISBN
    978-1-4673-2065-8
  • Electronic_ISBN
    0743-1546
  • Type

    conf

  • DOI
    10.1109/CDC.2012.6425857
  • Filename
    6425857