Title :
Statistical text analysis and sentiment classification in social media
Author :
Cho, Sang-Hyun ; Kang, Hang-Bong
Author_Institution :
Dept. of Comput. Eng., Catholic Univ. of Korea, Bucheon, South Korea
Abstract :
In this paper, we propose a new method of classifying tendencies and opinions in texts of multiple sentence length extracted from social media and covering both formal and informal vocabularies. To extract contextual information from the texts, we carry out computations based on keywords, the position of the sentence and the flow of sentiments in the multiple texts. A feature vector for the given text is constructed from the contextual information, and is then classified with a Support Vector Machine (SVM) classifier as positive, negative or neutral. Our method performs well in classifying the gradient of sentiments expressed in social media.
Keywords :
pattern classification; social networking (online); statistical analysis; support vector machines; text analysis; vocabulary; contextual information extraction; feature vector; formal vocabularies; informal vocabularies; opinion classification; sentiment classification; social media; statistical text analysis; support vector machine classifier; tendency classification; Consumer products; Dictionaries; Media; Motion pictures; Support vector machine classification; Vocabulary; SVM; social media; text sentiment;
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-1713-9
Electronic_ISBN :
978-1-4673-1712-2
DOI :
10.1109/ICSMC.2012.6377880