• DocumentCode
    3599714
  • Title

    Topic Block: Mining User Inner Interests for Text and Link Analysis in Social Networks

  • Author

    Wenyu Zang ; Chuan Zhou ; Xiao Wang ; Li Guo

  • Author_Institution
    Inst. of Comput. Technol., Beijing, China
  • fYear
    2014
  • Firstpage
    159
  • Lastpage
    165
  • Abstract
    Text corpus and link network are interrelated data in social networks. Discovering the inner relationship between these two kinds of data can help better understand the evolution mechanism underneath social networks. Moreover, social networks exhibit unique characteristics such as sparse and noisy in both text and link data. Thus, it is imperative to combine both text and link data to complement and correct mining results. However, previous work did not explore a uniform generative model that can unveil their inner relationship probably because of the difficulty to harness the heterogenous data in social networks. To address this issue, in this paper we present a generative model Topic Block that clearly pinpoints the latent concept underlying the text corpus and link network, i.e., User inner interests. In our generative model, user inner interests guide the generation of the topic and community distributions underlying the text corpus and link data. We can infer the topic and community distributions based on the user inner interests through both content and topology information. Compared to existing popular models, our method experimentally outperforms on three real world social network data sets.
  • Keywords
    data mining; social networking (online); text analysis; Topic Block; community distributions; link analysis; link data; social networks; text analysis; text corpus; user inner interests mining; Analytical models; Collaboration; Communities; Data mining; Data models; Electronic mail; Social network services; social networks; topic model; user inner interests;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Computing, Applications and Technologies (PDCAT), 2014 15th International Conference on
  • Type

    conf

  • DOI
    10.1109/PDCAT.2014.33
  • Filename
    7174781