DocumentCode
659246
Title
Sentiment Analysis
Author
Bhattacharyya, P.
Author_Institution
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol., Mumbai, Mumbai, India
fYear
2013
fDate
13-14 Sept. 2013
Abstract
Summary form only given. Sentiment analysis is an exciting new field of research in Artificial Intelligence combining Natural Language Processing, Machine Learning and Psychology. Since 2000, due to the proliferation of huge amounts of opinions in electronic form on the web, on social networks and on blogs, automatic means of polarity (positive, negative and neutral) detection in texts flourished in leaps and bounds. Individual and organizations with public interface can no longer afford to be oblivious of sentiments expressed about them in electronic form. In the present tutorial, we will first discuss the foundations of sentiment analysis, covering knowledge based and machine learning based techniques. Feature engineering forms an important part of the task. Starting from word level features we move to more sophisticated text units and explore their efficacy for SA. After this, we describe our research on sentiment analysis that attempts to advance the state of the art by tackling new text form (Tweets), a new task (Thwarting), new language (Indian languages) and new features (word senses instead of words). Tweets are noisy texts; they need text cleaning and normalization. Exploitation of discourse relations like `although´, `still´, `but´ and so on improve the accuracy considerably. Languages differ in terms of annotated resources. However, projecting the parameters learnt from one language to another can ameliorate the problem of resource scarcity. We show the use of these ideas for sentiment analysis of Indian languages. We demonstrate that working with senses instead of just words is not only advantageous from the point of view of multilinguality, but also for accuracy. Finally, problems like thwarting (reversal of polarity just by a single statement at a critical place in the text) and sarcasm are hard problems. We present some of our findings in these tasks too.
Keywords
Internet; learning (artificial intelligence); natural language processing; social networking (online); text analysis; Indian languages; Web; artificial intelligence; blogs; electronic form; knowledge based technique; machine learning based technique; natural language processing; polarity detection; psychology; resource scarcity problem; sentiment analysis; social networks; text cleaning; text normalization; text units; thwarting; tweets; word level features; word senses; Abstracts; Accuracy; Computer science; Learning (artificial intelligence); Natural language processing; Psychology; Tutorials;
fLanguage
English
Publisher
ieee
Conference_Titel
Emerging Trends and Applications in Computer Science (ICETACS), 2013 1st International Conference on
Conference_Location
Shillong
Print_ISBN
978-1-4673-5249-9
Type
conf
DOI
10.1109/ICETACS.2013.6691379
Filename
6691379
Link To Document