DocumentCode
131945
Title
A new approach for detecting spam microblogs based on text and user´s social network features
Author
Kaiyu Wang ; Yumei Wang ; Hongqiao Li ; Yilin Xiong ; Xinyu Zhang
Author_Institution
Sch. of Inf. & Commun. Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2014
fDate
11-14 May 2014
Firstpage
1
Lastpage
5
Abstract
Recently more and more spam messages are emerging on microblogs, which leads to an unpleasant or even deteriorating social network environment. Existing studies on spam microblog detection mostly make use of textual features or social network features alone to detect spam messages. While in this paper, we propose a new detection approach from users´ perspective, which combines social network features of the publishers with textual features of microblogs itself together, to compose a feature vector. By feeding the feature vector into a SVM machine learning system for data training, we classify spam microblogs from benign ones. We conduct experiments with the dataset of Sina Weibo, one of the most famous Chinese microblogs, to verify the effectiveness of our approach. Compared to the approaches which only consider textual or network features, we observe 13% and 29% increases of accuracy respectively with our proposed approach.
Keywords
feature extraction; learning (artificial intelligence); social networking (online); support vector machines; text analysis; unsolicited e-mail; Chinese microblogs; SVM machine learning system; Sina Weibo; data training; social network environment; spam messages; spam microblog classification; spam microblog detection; text features; textual features; user social network features; Accuracy; Feature extraction; Kernel; Social network services; Support vector machine classification; Training; social networks; spam microblogs; support vector machine (SVM); textual feature;
fLanguage
English
Publisher
ieee
Conference_Titel
Wireless Communications, Vehicular Technology, Information Theory and Aerospace & Electronic Systems (VITAE), 2014 4th International Conference on
Conference_Location
Aalborg
Print_ISBN
978-1-4799-4626-6
Type
conf
DOI
10.1109/VITAE.2014.6934446
Filename
6934446
Link To Document