Title :
A study of effective features for detecting long-surviving Twitter spam accounts
Author :
Po-Ching Lin ; Po-Min Huang
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Chung Cheng Univ., Chaiyi, Taiwan
Abstract :
As social networking websites get popular in recent years, the spam accounts on them also increase rapidly. A number of features to detect spam accounts have been proposed in prior studies. In this work, we evaluate the common features to see how effective they are to detect Twitter spam accounts or not. We used the Twitter APIs to collect 26,758 public accounts with 508,403 tweets intermittently over the period from September 2011 to March 2012. Although Twitter has its own method to detect and suspend spam accounts, many long-surviving spam accounts are still on it. We selected 816 out of the long-surviving accounts to persistently observe their activities and recorded their lifespans. We find that some features are not so effective for the detection, and thus select two features, the URL rate and the interaction rate, to be the detection features in this work. With classification by the J4.8 algorithm, the precision of the detection is estimated to be between 0.829 and 0.885, and the recall is between 0.987 and 0.999. We also present several additional techniques to further improve the detection accuracy.
Keywords :
application program interfaces; social networking (online); J4.8 algorithm; Twitter API; URL rate; interaction rate; long-surviving Twitter spam accounts; social networking Websites; Manuals; Twitter; Uniform resource locators; Twitter spam; URL rate; detection; effective features; interaction rate;
Conference_Titel :
Advanced Communication Technology (ICACT), 2013 15th International Conference on
Conference_Location :
PyeongChang
Print_ISBN :
978-1-4673-3148-7