Sentiment analysis for various SNS media using Naïve Bayes classifier and its application to flaming detection

Author

Yoshida, Sigeru ; Kitazono, Jun ; Ozawa, Seiichi ; Sugawara, Toshiki ; Haga, Tatsuya ; Nakamura, Shigenari

Author_Institution

Grad. Sch. of Eng., Kobe Univ., Kobe, Japan

fYear

2014

fDate

9-12 Dec. 2014

Firstpage

1

Lastpage

6

Abstract

SNS is one of the most effective communication tools and it has brought about drastic changes in our lives. Recently, however, a phenomenon called flaming or backlash becomes an imminent problem to private companies. A flaming incident is usually triggered by thoughtless comments/actions on SNS, and it sometimes ends up damaging to the company´s reputation seriously. In this paper, in order to prevent such unexpected damage to the company´s reputation, we propose a new approach to sentiment analysis using a Naïve Bayes classifier, in which the features of tweets/comments are selected based on entropy-based criteria and an empirical rule to capture negative expressions. In addition, we propose a semi-supervised learning approach to relabeling noisy training data, which come from various SNS media such as Twitter, Facebook, blogs and a Japanese textboard called `2-channel´. In the experiments, we use four data sets of users´ comments, which were posted to different SNS media of private companies. The experimental results show that the proposed Naïve Bayes classifier model has good performance for different SNS media, and a semi-supervised learning effectively works for the data consisting of long comments. In addition, the proposed method is applied to detect flaming incidents, and we show that it is successfully detected.

Keywords

Bayes methods; entropy; feature selection; learning (artificial intelligence); natural language processing; pattern classification; social networking (online); text analysis; 2-channel; Facebook; Japanese textboard; SNS media; Twitter; backlash; blogs; comments feature selection; communication tools; company reputation; entropy-based criteria; flaming incident detection; naive Bayes classifier model; noisy training data; semisupervised learning approach; sentiment analysis; tweets feature selection; Companies; Dictionaries; Entropy; Media; Semisupervised learning; Support vector machines; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Computational Intelligence in Big Data (CIBD), 2014 IEEE Symposium on

Conference_Location

Orlando, FL

Type

conf

DOI

10.1109/CIBD.2014.7011523

Filename

7011523