• DocumentCode
    1800029
  • Title

    Sentiment analysis for various SNS media using Naïve Bayes classifier and its application to flaming detection

  • Author

    Yoshida, Sigeru ; Kitazono, Jun ; Ozawa, Seiichi ; Sugawara, Toshiki ; Haga, Tatsuya ; Nakamura, Shigenari

  • Author_Institution
    Grad. Sch. of Eng., Kobe Univ., Kobe, Japan
  • fYear
    2014
  • fDate
    9-12 Dec. 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    SNS is one of the most effective communication tools and it has brought about drastic changes in our lives. Recently, however, a phenomenon called flaming or backlash becomes an imminent problem to private companies. A flaming incident is usually triggered by thoughtless comments/actions on SNS, and it sometimes ends up damaging to the company´s reputation seriously. In this paper, in order to prevent such unexpected damage to the company´s reputation, we propose a new approach to sentiment analysis using a Naïve Bayes classifier, in which the features of tweets/comments are selected based on entropy-based criteria and an empirical rule to capture negative expressions. In addition, we propose a semi-supervised learning approach to relabeling noisy training data, which come from various SNS media such as Twitter, Facebook, blogs and a Japanese textboard called `2-channel´. In the experiments, we use four data sets of users´ comments, which were posted to different SNS media of private companies. The experimental results show that the proposed Naïve Bayes classifier model has good performance for different SNS media, and a semi-supervised learning effectively works for the data consisting of long comments. In addition, the proposed method is applied to detect flaming incidents, and we show that it is successfully detected.
  • Keywords
    Bayes methods; entropy; feature selection; learning (artificial intelligence); natural language processing; pattern classification; social networking (online); text analysis; 2-channel; Facebook; Japanese textboard; SNS media; Twitter; backlash; blogs; comments feature selection; communication tools; company reputation; entropy-based criteria; flaming incident detection; naive Bayes classifier model; noisy training data; semisupervised learning approach; sentiment analysis; tweets feature selection; Companies; Dictionaries; Entropy; Media; Semisupervised learning; Support vector machines; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Big Data (CIBD), 2014 IEEE Symposium on
  • Conference_Location
    Orlando, FL
  • Type

    conf

  • DOI
    10.1109/CIBD.2014.7011523
  • Filename
    7011523