Title :
Inferring cellular user demographic information using homophily on call graphs
Author :
Vi Wang ; Hui Zang ; Faloutsos, Michalis
Author_Institution :
Univ. of California Riverside, Riverside, CA, USA
Abstract :
Homophily refers to the phenomenon where people who are socially-connected share many characteristics including demographic and behavioral properties. The goal of this paper is to see whether homophily exists in call networks and if so, to what degree we can infer a cellphone user´s demographic properties by knowing the demographic information of the people that s/he talks to. We focus on three types of demographic information: a) home location, b) age group, and c) income level. The novelty is two-folds. First, we use both communication metrics and structural properties of call graphs to identify those “important” friends for each user with whom (s)he is most likely to be in homophily. Second, we assess the importance of different time slices such as weekdays, or nights and weekends for capturing different user relationships. We conduct our study on a real data trace with 20M subscribers during one month from a nationwide cellular carrier. Our first contribution is that we quantify the extent of homophily on the call graph and identify the correlations between homophily and communication and structural features. As a second contribution, we develop effective methods to infer demographic information for a cellular user using linear regression to select the most homophily-like friend of her/him. We find that we can predict home location within 20km radius with 80% accuracy, and age group and income level with 78% and 72% accuracy, respectively.
Keywords :
cellular radio; demography; graph theory; regression analysis; age group; behavioral properties; call graph; call network; cellphone user demographic properties; cellular user demographic information; communication metrics; home location; homophily-like friend; income level; linear regression; nationwide cellular carrier; socially-connected people; structural feature; structural property; user relationship; Accuracy; Communication networks; Conferences; Correlation; Linear regression; Prediction algorithms; Social network services;
Conference_Titel :
INFOCOM, 2013 Proceedings IEEE
Conference_Location :
Turin
Print_ISBN :
978-1-4673-5944-3
DOI :
10.1109/INFCOM.2013.6567165