• DocumentCode
    3439264
  • Title

    Multi-Class Sentiment Analysis with Clustering and Score Representation

  • Author

    Farhadloo, Mohsen ; Rolland, Eric

  • Author_Institution
    Sch. of Eng., Ernest & Julio Gallo Manage. Program & EECS Group, Univ. of California, Merced, Merced, CA, USA
  • fYear
    2013
  • fDate
    7-10 Dec. 2013
  • Firstpage
    904
  • Lastpage
    912
  • Abstract
    Sentiment analysis or opinion mining is the field of computational study of people´s opinion expressed in written language or text. Sentiment analysis brings together various research areas such as natural language processing, data mining and text mining, and is fast becoming of major importance to organizations as they integrate online commerce into their operations. This paper proposes improved methods for aspect-level sentiment analysis. We propose to utilize bag of nouns instead of bog of words to improve the clustering results for aspect identification and a new feature set, score representation, that leads to more accurate sentiment identification. This scheme is based upon the three scores (positive ness, neutral ness and negative ness) that are learned from the data for each term. Using this new score representation scheme, we improve the performance of 3-class sentiment analysis on sentences by 20% in terms of f1-measure, as compared to previously published research. We demonstrate the usefulness of the methodology using data from the popular online travel information site TripAdvisor.com.
  • Keywords
    data mining; information analysis; learning (artificial intelligence); pattern clustering; 3-class sentiment analysis; TripAdvisor.com; aspect identification; aspect-level sentiment analysis; bag-of-nouns; bag-of-words; clustering; computational study; data mining; f1-measure; feature set; learning; multiclass sentiment analysis; natural language processing; negativeness score; neutralness score; online commerce; online travel information site; opinion mining; people opinion; positiveness score; score representation; sentiment analysis; sentiment identification; text mining; Clustering algorithms; Data mining; Feature extraction; Organizations; Support vector machines; Vectors; Vocabulary; Sentiment Analysis; Text Mining; User Reviews;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
  • Conference_Location
    Dallas, TX
  • Print_ISBN
    978-1-4799-3143-9
  • Type

    conf

  • DOI
    10.1109/ICDMW.2013.63
  • Filename
    6754018