DocumentCode :
3495234
Title :
Graph-based features for supervised link prediction
Author :
Cukierski, William ; Hamner, Benjamin ; Yang, Bo
fYear :
2011
fDate :
July 31 2011-Aug. 5 2011
Firstpage :
1237
Lastpage :
1244
Abstract :
The growing ubiquity of social networks has spurred research in link prediction, which aims to predict new connections based on existing ones in the network. The 2011 IJCNN Social Network challenge asked participants to separate real edges from fake in a set of 8960 edges sampled from an anonymized, directed graph depicting a subset of relationships on Flickr. Our method incorporates 94 distinct graph features, used as input for classification with Random Forests. We present a three-pronged approach to the link prediction task, along with several novel variations on established similarity metrics. We discuss the challenges of processing a graph with more than a million nodes. We found that the best classification results were achieved through the combination of a large number of features that model different aspects of the graph structure. Our method achieved an area under the receiver-operator characteristic (ROC) curve of 0.9695, the 2nd best overall score in the competition and the best score which did not de-anonymize the dataset.
Keywords :
directed graphs; learning (artificial intelligence); pattern classification; social networking (online); IJCNN social network challenge; Random Forests; directed graph; graph-based features; receiver-operator characteristic; social networks; supervised link prediction; Approximation methods; Bayesian methods; Feature extraction; Prediction algorithms; Prediction methods; Social network services; Sparse matrices;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2011 International Joint Conference on
Conference_Location :
San Jose, CA
ISSN :
2161-4393
Print_ISBN :
978-1-4244-9635-8
Type :
conf
DOI :
10.1109/IJCNN.2011.6033365
Filename :
6033365
Link To Document :
بازگشت