• DocumentCode
    3466951
  • Title

    If We Want Your Opinion

  • Author

    Bikel, Daniel M. ; Sorensen, Jeffrey

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights
  • fYear
    2007
  • fDate
    17-19 Sept. 2007
  • Firstpage
    493
  • Lastpage
    500
  • Abstract
    Sentiment has traditionally been considered a "deep" attribute of writing, often requiring the interpretation of figurative language to uncover the writer\´s intention. The natural language processing community has become increasingly interested in detecting, through automatic means, the expression of opinions and measuring the intensity of emotions held by the writer. Despite the depth and abstraction often associated with expressions of sentiment, we apply strictly lexical analysis to the opinions expressed about books and find that machine learning techniques are capable of resolving even fine-grained distinctions between opinions. Using an averaged perceptron classifier trained using a word subsequence kernel, we achieve an accuracy of 89% when distinguishing between 1- and 5-star reviews. Further, this same model yields significant separation when scoring intermediate reviews - making distinctions even human annotators find difficult. We detail the collection of data for supervised training and present the results of our sentiment classifier along with some discussion about why we believe this approach to be effective.
  • Keywords
    classification; emotion recognition; learning (artificial intelligence); natural language processing; perceptrons; text analysis; emotion detection; lexical analysis; machine learning; natural language processing; perceptron classifier; sentiment classifier; supervised training; text classification; word subsequence kernel; Blogs; Books; Explosions; Humans; Kernel; Machine learning; Materials testing; Motion pictures; Natural language processing; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing, 2007. ICSC 2007. International Conference on
  • Conference_Location
    Irvine, CA
  • Print_ISBN
    978-0-7695-2997-4
  • Type

    conf

  • DOI
    10.1109/ICSC.2007.81
  • Filename
    4338386