DocumentCode :
678444
Title :
Uncovering the Location of Twitter Users
Author :
Rodrigues, Ernesto ; Assuncao, R. ; Pappa, Gisele L. ; Miranda, Robbin ; Meira, Wagner
Author_Institution :
Dept. de Estatistica, Univ. Fed. de Ouro Preto, Ouro Preto, Brazil
fYear :
2013
fDate :
19-24 Oct. 2013
Firstpage :
237
Lastpage :
241
Abstract :
Social networks, like Twitter and Facebook, are valuable sources to monitor real-time events, such as earthquakes and epidemics. For this type of surveillance the user´s location is an essential piece of information, but a substantial number of users choose not to disclose their geographical information. However, characteristics of the users´ behavior, such as the friends they associate with and the types of messages published may hint on their spatial location. In this paper, we present a method to infer the spatial location of Twitter users. Unlike the approaches proposed so far, we incorporate two sources of information to learn the geographical position: the text posted by users and their friendship network. We propose a probabilistic approach that jointly models the geographical labels and the Twitter texts of the users organized in the form of a graph representing the friendship network. We use the Markov random field probability model to represent the network and learning is carried out through a Markov chain Monte Carlo simulation technique to approximate the posterior probability distribution of the missing geographical labels. We demonstrate the utility of this model in a large dataset of Twitter users, where the ground truth is the location given by the GPS position, GeoIP location or declared location. The method is evaluated and compared to two baseline algorithms that employ either of these two types of information. The accuracy rates achieved are significantly better than those of the baseline methods.
Keywords :
Markov processes; Monte Carlo methods; inference mechanisms; learning (artificial intelligence); network theory (graphs); social networking (online); statistical distributions; Facebook; GPS position; GeoIP location; Global Positioning System; Markov chain Monte Carlo simulation technique; Markov random field; Twitter users location; declared location; friendship network; geographical information; geographical labels; graph; information sources; learning; network representation; posterior probability distribution; probabilistic approach; social networks; spatial location; text post; user behavior; Accuracy; Approximation algorithms; Cities and towns; Markov processes; Probabilistic logic; Probability distribution; Twitter; Geographic Targeting; Geolocation Estimation; Network Learning; Spatial Data Mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems (BRACIS), 2013 Brazilian Conference on
Conference_Location :
Fortaleza
Type :
conf
DOI :
10.1109/BRACIS.2013.47
Filename :
6726455
Link To Document :
بازگشت