DocumentCode
679954
Title
A term weighting method for identifying emotions from text content
Author
De Silva, Jenomi ; Haddela, P.S.
Author_Institution
Dept. of Inf. Technol., Sri Lanka Inst. of Inf. Technol., Malabe, Sri Lanka
fYear
2013
fDate
17-20 Dec. 2013
Firstpage
381
Lastpage
386
Abstract
Since the inception of the concept of social networking, communication patterns have shifted drastically with the unmitigated trend in socializing over the Internet, especially when people began connecting via mobile devices. Nowadays people tend to use these modern communication systems to share their emotions with each other. Human emotions play a vital role in human relationships and people share their emotions through facial expressions, gestures, speech and text messages. However, text messaging is the most common and widely accepted method to exchange information among peers through the Internet and mobile networks. In comparison to other methods, identifying emotions from text messages is rather difficult for the recipient. Therefore, the need of automating the emotion recognition from textual content has increased. Utilization of text classification techniques can be considered as the most common approach of identifying emotions from textual content. Prior to applying a text classifier, the textual data should be transformed into a data structure that the classifier understands by conforming to a document representation model and term weighting method. For this research Vector Space Model (VSM) is used as the document representation model. This paper proposes an extension to the Term Frequency - Inverse Document Frequency (TF-IDF) weighting method to increase classification accuracy and explains experiments conducted to discover the best term weighting method in vector space to be used in feature (text term) extraction from Aman´s emotion text corpus. The text classification is done using Oracle´s ODM SVM tool and LibSVM tool.
Keywords
Internet; data structures; emotion recognition; mobile computing; pattern classification; social networking (online); support vector machines; text analysis; Aman emotion text corpus; Internet; LibSVM tool; Oracle ODM SVM tool; TF-IDF; VSM; classification accuracy; communication patterns; data structure; emotion identification; facial expressions; gestures; human relationships; mobile devices; modern communication systems; social networking; speech; term frequency-inverse document frequency; term weighting method; text classification techniques; text content; text messages; vector space model; Accuracy; Classification algorithms; Emotion recognition; Feature extraction; Support vector machine classification; Text categorization; Support Vector Machine; Term weighting; Text Classification; Vector Space Model;
fLanguage
English
Publisher
ieee
Conference_Titel
Industrial and Information Systems (ICIIS), 2013 8th IEEE International Conference on
Conference_Location
Peradeniya
Print_ISBN
978-1-4799-0908-7
Type
conf
DOI
10.1109/ICIInfS.2013.6732014
Filename
6732014
Link To Document