DocumentCode
3466951
Title
If We Want Your Opinion
Author
Bikel, Daniel M. ; Sorensen, Jeffrey
Author_Institution
IBM T. J. Watson Res. Center, Yorktown Heights
fYear
2007
fDate
17-19 Sept. 2007
Firstpage
493
Lastpage
500
Abstract
Sentiment has traditionally been considered a "deep" attribute of writing, often requiring the interpretation of figurative language to uncover the writer\´s intention. The natural language processing community has become increasingly interested in detecting, through automatic means, the expression of opinions and measuring the intensity of emotions held by the writer. Despite the depth and abstraction often associated with expressions of sentiment, we apply strictly lexical analysis to the opinions expressed about books and find that machine learning techniques are capable of resolving even fine-grained distinctions between opinions. Using an averaged perceptron classifier trained using a word subsequence kernel, we achieve an accuracy of 89% when distinguishing between 1- and 5-star reviews. Further, this same model yields significant separation when scoring intermediate reviews - making distinctions even human annotators find difficult. We detail the collection of data for supervised training and present the results of our sentiment classifier along with some discussion about why we believe this approach to be effective.
Keywords
classification; emotion recognition; learning (artificial intelligence); natural language processing; perceptrons; text analysis; emotion detection; lexical analysis; machine learning; natural language processing; perceptron classifier; sentiment classifier; supervised training; text classification; word subsequence kernel; Blogs; Books; Explosions; Humans; Kernel; Machine learning; Materials testing; Motion pictures; Natural language processing; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Semantic Computing, 2007. ICSC 2007. International Conference on
Conference_Location
Irvine, CA
Print_ISBN
978-0-7695-2997-4
Type
conf
DOI
10.1109/ICSC.2007.81
Filename
4338386
Link To Document