DocumentCode
2210129
Title
Mixed-Membership Stochastic Block-Models for Transactional Networks
Author
Shafiei, Mahdi ; Chipman, Hugh
Author_Institution
Dept. of Math. & Stat., Acadia Univ., Wolfville, NS, Canada
fYear
2010
fDate
13-17 Dec. 2010
Firstpage
1019
Lastpage
1024
Abstract
Transactional network data can be thought of as a list of one-to-many communications (e.g., email) between nodes in a social network. Most social network models convert this type of data into binary relations between pairs of nodes. We develop a latent mixed membership model capable of modeling richer forms of transactional network data, including relations between more than two nodes. The model can cluster nodes and predict transactions. The block-model nature of the model implies that groups can be characterized in very general ways. This flexible notion of group structure enables discovery of rich structure in transactional networks. Estimation and inference are accomplished via a variational EM algorithm. Simulations indicate that the learning algorithm can recover the correct generative model. Interesting structure is discovered in the Enron email dataset and another dataset extracted from the Reddit website. Analysis of the Reddit data is facilitated by a novel performance measure for comparing two soft clusterings. The new model is superior at discovering mixed membership in groups and in predicting transactions.
Keywords
data mining; learning (artificial intelligence); social networking (online); stochastic processes; transaction processing; Enron email dataset; Reddit Website; binary relation; generative model; learning algorithm; mixed membership stochastic block model; one-to-many communication; social network; soft clustering; transaction prediction; transactional network data; variational EM algorithm; Clustering; Email Data; Mixedmembership; Social Network Analysis; Variational EM;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location
Sydney, NSW
ISSN
1550-4786
Print_ISBN
978-1-4244-9131-5
Electronic_ISBN
1550-4786
Type
conf
DOI
10.1109/ICDM.2010.88
Filename
5694078
Link To Document