• DocumentCode
    140926
  • Title

    Region sampling and estimation of geosocial data with dynamic range calibration

  • Author

    Yanhua Li ; Steiner, Matthias ; Jie Bao ; Limin Wang ; Ting Zhu

  • Author_Institution
    HUAWEI Noah´s Ark Lab., China
  • fYear
    2014
  • fDate
    March 31 2014-April 4 2014
  • Firstpage
    1096
  • Lastpage
    1107
  • Abstract
    Location based social networks (LBSNs) are becoming increasingly popular with the fast deployment of broadband mobile networks and the growing prevalence of versatile mobile devices. This success has attracted great interest in studying and measuring the characteristics of LBSNs, such as Facebook Places, Yelp, and Google+ Local. However, it is often prohibitive, and sometimes too costly, to obtain a detailed and complete snapshot of a LBSN due to its usually massive scale. In this work, taking Foursquare as an example, we focus on sampling and estimating restricted geographic regions in LBSNs, such as a city or a country. By exploiting the application programming interfaces (APIs) provided by Foursquare for geographic search, we first introduce how to obtain the “ground truth”, namely, a complete set of all venues (i.e., places) in a specified region. Then, we propose random region sampling algorithms that allow us to draw representative samples of venues, and design unbiased estimators of regional characteristics of venues. We validate the efficiency of our sampling algorithms on Foursquare using complete datasets obtained from 12 regions, such as Switzerland, New York City and Los Angeles. Our results are applicable to perform sampling and estimation in all GeoDatabases, such as Facebook Places, Yelp, and Google+ Local, which have similar venue search APIs as Foursquare. These location service providers can also benefit from our results to enable efficient online statistic estimation.
  • Keywords
    application program interfaces; data handling; geographic information systems; mobile computing; social networking (online); API; Facebook; Google; LBSN; Yelp; application programming interfaces; broadband mobile networks; dynamic range calibration; geographic regions; geosocial data; location based social networks; mobile devices; random region sampling algorithms; region estimation; region sampling; regional characteristics; Algorithm design and analysis; Cities and towns; Clustering algorithms; Educational institutions; Estimation; Facebook;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2014 IEEE 30th International Conference on
  • Conference_Location
    Chicago, IL
  • Type

    conf

  • DOI
    10.1109/ICDE.2014.6816726
  • Filename
    6816726