Title :
Learning Domain-Specific Polarity Lexicons
Author :
Demiroz, Gulsen ; Yanikoglu, Benin ; Tapucu, D. ; Saygin, Yucel
Author_Institution :
Fac. of Eng. & Natural Sci., Sabanci Univ., Istanbul, Turkey
Abstract :
Sentiment analysis aims to automatically estimate the sentiment in a given text as positive or negative. Polarity lexicons, often used in sentiment analysis, indicate how positive or negative each term in the lexicon is. However, since creating domain-specific polarity lexicons is expensive and time consuming, researchers often use a general purpose or domain independent lexicon. In this work, we address the problem of adapting a general purpose polarity lexicon to a specific domain and propose a simple yet effective adaptation algorithm. We experimented with two sets of reviews from the hotel and movie domains and observed that while our adaptation techniques changed the polarity values for only a small set of words, the overall test accuracy increased significantly: 77% to 83% in the hotel dataset and 61% to 66% in the movie dataset.
Keywords :
data mining; learning (artificial intelligence); natural language processing; text analysis; adaptation algorithm; automatic sentiment estimation; domain-specific polarity lexicon learning; general purpose polarity lexicon; hotel dataset; hotel reviews; movie dataset; movie reviews; natural language processing; sentiment analysis; text analysis; Accuracy; Computational linguistics; Databases; Feature extraction; Motion pictures; Training; USA Councils; lexicon adaptation; machine learning; natural language processing; polarity detection; sentiment analysis;
Conference_Titel :
Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-5164-5
DOI :
10.1109/ICDMW.2012.120