Title :
Multilingual Sentiment Classification on Large Textual Data
Author :
Polpinij, Jantima
Author_Institution :
Dept. of Comput. Sci., Mahasarakham Univ., Mahasarakham, Thailand
Abstract :
At present, Big Data have been created lot of buzz in the technology world. Sentiment Analysis or opinion mining is one of the important applications of \´Big Data\´, where sentiment analysis is used for recognising voice or response of crowd for products, services. This concept describes the items in some detail and evaluate them as good/bad, preferred/not preferred. The results are very important for a company because customer feedback can yield extremely valuable insights about a company\´s customer. However, in a commercial website of product reviews, many customers can access to describe the items in some detail and evaluate them with different languages. Therefore, many companies will gather customer feedback in multiple languages. Definitely, feedback in multiple languages raises problems in analysing the material. As this, this paper proposes a solution to classify a product review dataset into two classes: positive and negative sentiments. The proposed methodology is called "Multilingual Sentiment Classification (MSC)". It consists of two main processing steps: lingual separation and sentiment classification. The first main processing step is to classify online product reviews into language classes. The second processing step is to classify each textual dataset into two classes: positive and negative sentiments. It is noted, we concentrate and experiment on bilingual texts (Thai and English).
Keywords :
Big Data; data mining; natural language processing; text analysis; Big Data; MSC; commercial Website; customer feedback; large textual data; lingual separation; multilingual sentiment classification; negative sentiments; opinion mining; positive sentiments; product reviews; sentiment analysis; Accuracy; Big data; Companies; Kernel; Large scale integration; Sentiment analysis; Support vector machines; Big Data; Bilingual Text; Multiple Language; Product Reviews; Sentiment Classification;
Conference_Titel :
Big Data and Cloud Computing (BdCloud), 2014 IEEE Fourth International Conference on
Conference_Location :
Sydney, NSW
DOI :
10.1109/BDCloud.2014.15