• DocumentCode
    3756757
  • Title

    TubeSpam: Comment Spam Filtering on YouTube

  • Author

    T?lio C. ;Johannes V. Lochter;Tiago A. Almeida

  • Author_Institution
    Dept. of Comput. Sci., Fed. Univ. of Sao Carlos, Sorocaba, Brazil
  • fYear
    2015
  • Firstpage
    138
  • Lastpage
    143
  • Abstract
    The profitability promoted by Google in its brand new video distribution platform YouTube has attracted an increasing number of users. However, such success has also attracted malicious users, which aim to self-promote their videos or disseminate viruses and malwares. Since YouTube offers limited tools for comment moderation, the spam volume is shockingly increasing which lead owners of famous channels to disable the comments section in their videos. Automatic comment spam filtering on YouTube is a challenge even for established classification methods, since the messages are very short and often rife with slangs, symbols and abbreviations. In this work, we have evaluated several top-performance classification techniques for such purpose. The statistical analysis of results indicate that, with 99.9% of confidence level, decision trees, logistic regression, Bernoulli Naive Bayes, random forests, linear and Gaussian SVMs are statistically equivalent. Based on this, we have also offered the TubeSpam - an accurate online system to filter comments posted on YouTube.
  • Keywords
    "YouTube","Unsolicited electronic mail","Blogs","Radio frequency","Google","Decision trees"
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2015 IEEE 14th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICMLA.2015.37
  • Filename
    7424299