Title :
Sentiment analysis for various SNS media using Naïve Bayes classifier and its application to flaming detection
Author :
Yoshida, Sigeru ; Kitazono, Jun ; Ozawa, Seiichi ; Sugawara, Toshiki ; Haga, Tatsuya ; Nakamura, Shigenari
Author_Institution :
Grad. Sch. of Eng., Kobe Univ., Kobe, Japan
Abstract :
SNS is one of the most effective communication tools and it has brought about drastic changes in our lives. Recently, however, a phenomenon called flaming or backlash becomes an imminent problem to private companies. A flaming incident is usually triggered by thoughtless comments/actions on SNS, and it sometimes ends up damaging to the company´s reputation seriously. In this paper, in order to prevent such unexpected damage to the company´s reputation, we propose a new approach to sentiment analysis using a Naïve Bayes classifier, in which the features of tweets/comments are selected based on entropy-based criteria and an empirical rule to capture negative expressions. In addition, we propose a semi-supervised learning approach to relabeling noisy training data, which come from various SNS media such as Twitter, Facebook, blogs and a Japanese textboard called `2-channel´. In the experiments, we use four data sets of users´ comments, which were posted to different SNS media of private companies. The experimental results show that the proposed Naïve Bayes classifier model has good performance for different SNS media, and a semi-supervised learning effectively works for the data consisting of long comments. In addition, the proposed method is applied to detect flaming incidents, and we show that it is successfully detected.
Keywords :
Bayes methods; entropy; feature selection; learning (artificial intelligence); natural language processing; pattern classification; social networking (online); text analysis; 2-channel; Facebook; Japanese textboard; SNS media; Twitter; backlash; blogs; comments feature selection; communication tools; company reputation; entropy-based criteria; flaming incident detection; naive Bayes classifier model; noisy training data; semisupervised learning approach; sentiment analysis; tweets feature selection; Companies; Dictionaries; Entropy; Media; Semisupervised learning; Support vector machines; Training data;
Conference_Titel :
Computational Intelligence in Big Data (CIBD), 2014 IEEE Symposium on
Conference_Location :
Orlando, FL
DOI :
10.1109/CIBD.2014.7011523