DocumentCode
3004618
Title
Measuring bias in the mixing time of social graphs due to graph sampling
Author
Mohaisen, Aziz ; Pengkui Luo ; Yanhua Li ; Yongdae Kim ; Zhi-Li Zhang
Author_Institution
Verisign Labs., VA, USA
fYear
2012
fDate
Oct. 29 2012-Nov. 1 2012
Firstpage
1
Lastpage
6
Abstract
Sampling of large social graphs is used for addressing infeasibility of measurements in large social graphs, or for crawling graphs from online social network services where accessing an entire social graph at once is often impossible. Sampling algorithms aim at maintaining certain properties of the original graphs in the sampled (or crawled) ones. Several sampling algorithms, such as breadth-first search, standard random walk, and Metropolis-Hastings random walk, among others, are widely used in the literature for sampling graphs. Some of these sampling algorithms are known for their bias, mainly towards high degree nodes, while bias for other metrics is not well-studied. In this paper we consider the bias of sampling algorithms on the mixing time. We quantitatively show that some existing sampling algorithms, even those which are unbiased to the degree distribution, always produce biased estimation of the mixing time of social graphs. We argue that bias in sampling algorithms accepted in the literature is rather metric-dependent, and a given sampling algorithm, while may work nicely and unbiased to one property, may produce considerable amount of bias in other properties.
Keywords
graph theory; random processes; sampling methods; social networking (online); tree searching; Metropolis-Hastings random walk; biased estimation; breadth-first search; crawling graph; degree distribution; graph sampling; mixing time; online social network service; social graph; standard random walk; Area measurement; Estimation; Facebook; Standards; Time measurement; YouTube; Biased estimation; Mixing Time; Sampling; Social graphs;
fLanguage
English
Publisher
ieee
Conference_Titel
MILITARY COMMUNICATIONS CONFERENCE, 2012 - MILCOM 2012
Conference_Location
Orlando, FL
ISSN
2155-7578
Print_ISBN
978-1-4673-1729-0
Type
conf
DOI
10.1109/MILCOM.2012.6415714
Filename
6415714
Link To Document