• DocumentCode
    2210129
  • Title

    Mixed-Membership Stochastic Block-Models for Transactional Networks

  • Author

    Shafiei, Mahdi ; Chipman, Hugh

  • Author_Institution
    Dept. of Math. & Stat., Acadia Univ., Wolfville, NS, Canada
  • fYear
    2010
  • fDate
    13-17 Dec. 2010
  • Firstpage
    1019
  • Lastpage
    1024
  • Abstract
    Transactional network data can be thought of as a list of one-to-many communications (e.g., email) between nodes in a social network. Most social network models convert this type of data into binary relations between pairs of nodes. We develop a latent mixed membership model capable of modeling richer forms of transactional network data, including relations between more than two nodes. The model can cluster nodes and predict transactions. The block-model nature of the model implies that groups can be characterized in very general ways. This flexible notion of group structure enables discovery of rich structure in transactional networks. Estimation and inference are accomplished via a variational EM algorithm. Simulations indicate that the learning algorithm can recover the correct generative model. Interesting structure is discovered in the Enron email dataset and another dataset extracted from the Reddit website. Analysis of the Reddit data is facilitated by a novel performance measure for comparing two soft clusterings. The new model is superior at discovering mixed membership in groups and in predicting transactions.
  • Keywords
    data mining; learning (artificial intelligence); social networking (online); stochastic processes; transaction processing; Enron email dataset; Reddit Website; binary relation; generative model; learning algorithm; mixed membership stochastic block model; one-to-many communication; social network; soft clustering; transaction prediction; transactional network data; variational EM algorithm; Clustering; Email Data; Mixedmembership; Social Network Analysis; Variational EM;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2010 IEEE 10th International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-9131-5
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2010.88
  • Filename
    5694078