DocumentCode :
1824135
Title :
On the utility of abstraction in labeling actors in social networks
Author :
Ngot Bui ; Honavar, V.
Author_Institution :
Comput. Sci. Dept., Iowa State Univ., Ames, IA, USA
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
692
Lastpage :
698
Abstract :
Social networks are naturally represented as heterogeneous networks with multiple types of objects e.g., actors, items and multiple types of links e.g., links between actors that denote social ties e.g., friendship, and links that connect actors to items e.g., photos, videos, articles, etc. that denote relationships between actors and items. In this paper, we consider the task of assigning labels to the unlabeled actors (individuals) in a large heterogeneous social network in which labels are available for a subset of actors. Specifically, we seek to learn a predictive model to label actors based on the attributes of the actors themselves and/or items that are linked to them in the network. Unfortunately, the number of distinct items, represented in real-world networks such as Facebook or Flickr is quite large (in the millions) although only a small subset of them are linked to specific actors. This leads to data sparsity which causes over-fitting and hence poor performance in predicting the labels of unlabeled actors. To address this problem, we induce hierarchical taxonomies over items and use the resulting taxonomies as a basis for selecting abstract and hence parsimonious representations of network data for learning the predictive models. Our experiments using three different predictors (Iterative classification Naïve Bayes, Iterative classification Logistic Regression, and EdgeCluster) on two real-world data sets, Last.fm and Flickr, show that the predictive models that take advantage of abstract representations of network data are competitive with, and in some cases, outperform those that do not.
Keywords :
Bayes methods; iterative methods; pattern classification; regression analysis; social networking (online); Facebook; Flickr; Last.fm; abstract representations; abstract selection; abstraction utility; edgecluster predictor; heterogeneous networks; heterogeneous social network; hierarchical taxonomies; iterative classification Naïve Bayes predictor; iterative classification logistic regression predictor; label assignment; labeling actors; parsimonious representations; predictive model learning; social ties; Abstracts; Conferences; Feature extraction; Labeling; Predictive models; Social network services; Videos;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2013 IEEE/ACM International Conference on
Conference_Location :
Niagara Falls, ON
Type :
conf
Filename :
6785778
Link To Document :
بازگشت