DocumentCode :
116444
Title :
Learning the information diffusion probabilities by using variance regularized EM algorithm
Author :
Haiguang Li ; Tianyu Cao ; Zhao Li
Author_Institution :
Univ. of Vermont, Burlington, VT, USA
fYear :
2014
fDate :
17-20 Aug. 2014
Firstpage :
273
Lastpage :
280
Abstract :
In this paper we address the problem of learning the information diffusion probabilities when there is no sufficient data of information diffusion. By observing the information diffusion behavior on the popular social network web-site Twitter, we find that the evidence of information diffusion is extremely sparse. Less than one percent of tweets are retweeted, which is considered as the most important form of information diffusion evidence on Twitter. Previous research on predicting information diffusion probabilities has failed under such scenarios because the problem of over fitting. To overcome this problem, we first propose to use the variance of the diffusion probabilities as a measure of model complexity for the independent cascade model. After that, we propose two regularization schemes to reduce model complexity. The first scheme is based on regularizing the variance of the diffusion probabilities directly. The second scheme is based on regularizing the mean absolute deviation of the logarithm of the diffusion probabilities. We are able to derive an approximation solution for the first scheme and analytical solution to the second scheme. We conduct experiments by simulating information diffusion on six social network datasets. Experimental results show that the variance regularization scheme outperforms the baseline by a noticeable margin. The mean absolute deviation regularization scheme is better than the baseline.
Keywords :
approximation theory; computational complexity; probability; social networking (online); Twitter; approximation solution; independent cascade model; information diffusion probabilities; model complexity; popular social network Website; variance regularization scheme; variance regularized EM algorithm; Complexity theory; Equations; Mathematical model; Maximum likelihood estimation; Twitter; Independent Cascade Model; Learning; Maximum likelihood estimation; Regularization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2014 IEEE/ACM International Conference on
Conference_Location :
Beijing
Type :
conf
DOI :
10.1109/ASONAM.2014.6921596
Filename :
6921596
Link To Document :
بازگشت