DocumentCode
2720054
Title
Identifying users in social networks with limited information
Author
Vesdapunt, Norases ; Garcia-Molina, Hector
Author_Institution
Comput. Sci. Dept., Stanford Univ., Stanford, CA, USA
fYear
2015
fDate
13-17 April 2015
Firstpage
627
Lastpage
638
Abstract
We study the problem of Entity Resolution (ER) with limited information. ER is the problem of identifying and merging records that represent the same real-world entity. In this paper, we focus on the resolution of a single node g from one social graph (Google+ in our case) against a second social graph (Twitter in our case). We want to find the best match for g in Twitter, by dynamically probing the Twitter graph (using a public API), limited by the number of API calls that social systems allow. We propose two strategies that are designed for limited information and can be adapted to different limits. We evaluate our strategies against a naive one on a real dataset and show that our strategies can provide improved accuracy with significantly fewer API calls.
Keywords
application program interfaces; graph theory; social networking (online); API calls; ER; Google+; Twitter graph; entity resolution; limited information; naive strategies; public API; social graph; social networks; user identification; Accuracy; Erbium; Google; Logistics; Probes; Twitter;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering (ICDE), 2015 IEEE 31st International Conference on
Conference_Location
Seoul
Type
conf
DOI
10.1109/ICDE.2015.7113320
Filename
7113320
Link To Document