DocumentCode :
87803
Title :
Mining user interest in microblogs with a user-topic model
Author :
He Li ; Jia Yan ; Han Weihong ; Ding Zhaoyun
Author_Institution :
Sch. of Comput. Sci., Nat. Univ. of Defense Technol., Changsha, China
Volume :
11
Issue :
8
fYear :
2014
fDate :
Aug. 2014
Firstpage :
131
Lastpage :
144
Abstract :
Microblogs have become an important platform for people to publish, transform information and acquire knowledge. This paper focuses on the problem of discovering user interest in microblogs. In this paper, we propose a topic mining model based on Latent Dirichlet Allocation (LDA) named user-topic model. For each user, the interests are divided into two parts by different ways to generate the microblogs: original interest and retweet interest. We represent a Gibbs sampling implementation for inference the parameters of our model, and discover not only user´s original interest, but also retweet interest. Then we combine original interest and retweet interest to compute interest words for users. Experiments on a dataset of Sina microblogs demonstrate that our model is able to discover user interest effectively and outperforms existing topic models in this task. And we find that original interest and retweet interest are similar and the topics of interest contain user labels. The interest words discovered by our model reflect user labels, but range is much broader.
Keywords :
data mining; social networking (online); Gibbs sampling; LDA; Sina microblog dataset; latent Dirichlet allocation; original interest; retweet interest; topic mining model; user interest mining; user labels; user-topic model; Computational modeling; Data models; Educational institutions; Mathematical model; Probability distribution; Twitter; LDA; microblogs; topic mining; user interest; user-topic model;
fLanguage :
English
Journal_Title :
Communications, China
Publisher :
ieee
ISSN :
1673-5447
Type :
jour
DOI :
10.1109/CC.2014.6911095
Filename :
6911095
Link To Document :
بازگشت