• DocumentCode
    731005
  • Title

    Asymmetric self-learning for tackling Twitter Spam Drift

  • Author

    Chao Chen ; Jun Zhang ; Yang Xiang ; Wanlei Zhou

  • Author_Institution
    Sch. of Inf. Technol., Deakin Univ., Burwood, VIC, Australia
  • fYear
    2015
  • fDate
    April 26 2015-May 1 2015
  • Firstpage
    208
  • Lastpage
    213
  • Abstract
    Spam has become a critical problem on Twitter. In order to stop spammers, security companies apply blacklisting services to filter spam links. However, over 90% victims will visit a new malicious link before it is blocked by blacklists. To eliminate the limitation of blacklists, researchers have proposed a number of statistical features based mechanisms, and applied machine learning techniques to detect Twitter spam. In our labelled large dataset, we observe that the statistical properties of spam tweets vary over time, and thus the performance of existing ML based classifiers are poor. This phenomenon is referred as “Twitter Spam Drift”. In order to tackle this problem, we carry out deep analysis of 1 million spam tweets and 1 million non-spam tweets, and propose an asymmetric self-learning (ASL) approach. The proposed ASL can discover new information of changed tweeter spam and incorporate it into classifier training process. A number of experiments are performed to evaluate the ASL approach. The results show that the ASL approach can be used to significantly improve the spam detection accuracy of using traditional ML algorithms.
  • Keywords
    learning (artificial intelligence); pattern classification; social networking (online); unsolicited e-mail; ASL approach; ML algorithms; ML based classifiers; Twitter spam drift; asymmetric self-learning; blacklisting services; classifier training process; filter spam links; machine learning techniques; malicious link; statistical features based mechanisms; Feature extraction; Market research; Security; Training; Twitter; Uniform resource locators; Unsolicited electronic mail;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communications Workshops (INFOCOM WKSHPS), 2015 IEEE Conference on
  • Conference_Location
    Hong Kong
  • Type

    conf

  • DOI
    10.1109/INFCOMW.2015.7179386
  • Filename
    7179386