DocumentCode
3439264
Title
Multi-Class Sentiment Analysis with Clustering and Score Representation
Author
Farhadloo, Mohsen ; Rolland, Eric
Author_Institution
Sch. of Eng., Ernest & Julio Gallo Manage. Program & EECS Group, Univ. of California, Merced, Merced, CA, USA
fYear
2013
fDate
7-10 Dec. 2013
Firstpage
904
Lastpage
912
Abstract
Sentiment analysis or opinion mining is the field of computational study of people´s opinion expressed in written language or text. Sentiment analysis brings together various research areas such as natural language processing, data mining and text mining, and is fast becoming of major importance to organizations as they integrate online commerce into their operations. This paper proposes improved methods for aspect-level sentiment analysis. We propose to utilize bag of nouns instead of bog of words to improve the clustering results for aspect identification and a new feature set, score representation, that leads to more accurate sentiment identification. This scheme is based upon the three scores (positive ness, neutral ness and negative ness) that are learned from the data for each term. Using this new score representation scheme, we improve the performance of 3-class sentiment analysis on sentences by 20% in terms of f1-measure, as compared to previously published research. We demonstrate the usefulness of the methodology using data from the popular online travel information site TripAdvisor.com.
Keywords
data mining; information analysis; learning (artificial intelligence); pattern clustering; 3-class sentiment analysis; TripAdvisor.com; aspect identification; aspect-level sentiment analysis; bag-of-nouns; bag-of-words; clustering; computational study; data mining; f1-measure; feature set; learning; multiclass sentiment analysis; natural language processing; negativeness score; neutralness score; online commerce; online travel information site; opinion mining; people opinion; positiveness score; score representation; sentiment analysis; sentiment identification; text mining; Clustering algorithms; Data mining; Feature extraction; Organizations; Support vector machines; Vectors; Vocabulary; Sentiment Analysis; Text Mining; User Reviews;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location
Dallas, TX
Print_ISBN
978-1-4799-3143-9
Type
conf
DOI
10.1109/ICDMW.2013.63
Filename
6754018
Link To Document