DocumentCode
1786487
Title
Dynamic detection of spammers in Weibo
Author
Zhang Cheng ; Niu Kai ; He Zhiqiang
Author_Institution
Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2014
fDate
19-21 Sept. 2014
Firstpage
112
Lastpage
116
Abstract
Social networks have developed to maturity up to now. And Weibo submitted its initial public offerings (IPO) to U.S. securities and exchange commission (SEC) on March 15th 2014. However, spamming has been a long existing problem on the Internet, it has existed at the time of Web 1.0. With fast development of social networks, spamming problem emerged and became more complex in social network service. Spammers post feeds containing typical phrases of a trending topic and URLs that usually are uncorrelated with feeds content. These URLs will lead users to certain websites that usually are Taobao shop sites, so spammers can earn money. It is an urgent task to construct mechanisms to automaticly detect and stop spammers. Researches about this have been done. But because of the game between spammers and antispam systems, behavior parttens of spammers changes constantly. In this paper we present an adaptive framwork to detecting spammers dynamiclly by using incremental learning of machine learning algorithm. To access to the identification of spammers we analyze Weibo user behaviors systematically, and find different behavior parttens between spammers and legitimate users. To collect a large set of Weibo user samples, we apply for a high privilege developer account and devise an effective method using Weibo open platform. We collected a large dataset of Sina Weibo which includes 30 million users and 46 million feeds and 980 million links. By checking users´ tweeting behaviors, we gathered training user samples including spammers and legitimate users manually. And then we compared characteristics of user social behaviors of spammers with legitimate users. These characteristics were used in our framwork to devide spammers from legitimate users. Through tests with real data it is proved that this approach can effectively identify the spammers in Weibo.
Keywords
learning (artificial intelligence); social networking (online); support vector machines; unsolicited e-mail; IPO; SEC; Taobao shop sites; U.S. securities-and-exchange commission; URL; Weibo open platform; Weibo user behavior pattern analysis; adaptive framwork; antispam systems; dynamic spammer detection; feed content; incremental learning; initial public offerings; legitimate users; machine learning algorithm; social network service; spammer behavior patterns; spammer identification; spammer identification access; spammer postfeeds; spamming problem; user social behaviors; user tweeting behaviors; Feature extraction; Feeds; Machine learning algorithms; Social network services; Support vector machines; Training; Unsolicited electronic mail; SVM; dynamic detection; machine learning; weibo spammers;
fLanguage
English
Publisher
ieee
Conference_Titel
Network Infrastructure and Digital Content (IC-NIDC), 2014 4th IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4799-4736-2
Type
conf
DOI
10.1109/ICNIDC.2014.7000276
Filename
7000276
Link To Document